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CHAPTER I 


` HYPOTHESIS AND PURPOSE 


O NE of the most striking deficiencies 
in previous studies of personality 
rigidity phenomena revolves about the 
fact that with but a few limited excep- 
tions they have all confined themselves 
to measuring rigidity in terms of very 
simple motor or perceptual factors. No 
attempt has been made to sample 
rigidity behavior simultaneously at dif- 
ferent levels of complexity or at differ- 
ent levels of emotional involvement. 
Consequently, there has been little basis 
for deciding such fundamental questions 
as to how generalized rigidity may be in 
the individual personality or whether 
there are distinctly different kinds of 
rigidity. This failure has at least in part 
been due to the fact that no one has 
formulated a general hypothesis which 
would define what is common to rigid 
behavior at different levels. For example, 
one may easily say that perseverative be- 
havior is a sign of rigidity when the per- 
formance of some simple motor task is 
involved; but how does one apply this 
kind of simple perseveration concept to 
the detection of rigidity in such complex 
behavior as an individual's aesthetic 
preferences or his interpretation of an 
event with emotional significance? The 
Present study was undertaken to attack 
Some of these problems—to explore in a 
broader sense the whole question of per- 
Sonality rigidity. In line with the above 
discussion it was necessary first of all to 
develop a hypothesis that would define 
how rigidity might manifest itself in 
diverse responses to diverse materials. A 
Presentation of this hypothesis and of 
the specific objectives of the study follow. 

The hypothesis that forms the basis for 
the measurement procedures utilized in 


this study is that there are persistent 
personality rigidity trends which are 
relatively independent of intelligence.t 
It will be assumed that such ‘rigidity 
trends reveal themselves in the degree to 
which any given individual is able to 
indicate in some behavioral way his 
ability to utilize alternate modes of re- 
sponse when dealing with problems or 
situations requiring adjustment. That is, 
it is postulated that given, for example, 
two individuals, A and B, of equal 
formal intelligence, who are faced with 
the necessity of solving some problem 
with perhaps six alternative modes of 
solution, that A will be able to utilize 
sall six of the possible modes of solution, 
and B will utilize perhaps only two of 
them, because of personality differentials 
in rigidity. In these terms an individual's 
rigidity in expressing aesthetic prefer- 
ences, or his rigidity in interpreting a 
picture with emotional meaning, or his 
rigidity in sorting varied geometric ob- 
jects into different classes are all measur- 
able on a related continuum—viz., the 
number of his aesthetic preferences, the 
number of his equivalent picture in- 
terpretations, and the number of his 
equivalent sortings. This reduction of 
the measurement of rigidity to number 
of equivalent alternatives utilized is a 
simplification dictated by the needs of 
experimental control; but it is also a 
kind of simplification that is based on a 
concept that has been implicit in almost 
all past rigidity measures. 


1 That is, intelligence in a limited formal or 
psychometric sense. It is recognized that intellec- 
tual factors pervade most areas of reaction. The 
assumption here is that intelligence in the for- 
mal sense is not significantly related to rigidity 
behavior resulting from the particular character 
of the personality structure, 
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It is important that it be clearly 
understood to what degree this brief 
hypothesis really does neglect the com- 
plexity of factors involved in rigidity 
phenomena. One suspects that if suffi- 
ciently sensitive measuring instruments 
were available, it would be possible to 
analyse rigidity phenomena in many 
different dimensions. Thus, Aypotheti- 
cally one might be able to measure 
rigidity as it affects perception of situa- 
tions, as it affects subjective reactions to 
situations, and of course as it affects 
overt behavioral reactions to situations. 
Furthermore, one might be able to de- 
Scribe rigidity in each of these dimen- 
sions in terms of a large number of 
descriptive continua (e.g., quickness with 
which evoked, degree of persistence after 
arousal and degree of generalization). . 
Ideally, it would be well to measure as 
many of these phases of rigidity as possi- 
ble. But it has been necessary here to 
treat the problem in a simpler fashion: 
to confine postulations to overt be- 
havioral manifestations of rigidity and 
to limit them to rigidity manifestations 
conceived to exist on a single restricted 
continuum (viz, number of utilized 
alternatives). This has been true simply 
because there are so few known tech- 
niques available for getting at rigidity 
phenomena in any dimension and be- 
cause the demands of experimental con- 
trol place a limit on the number of 
variables one may consider simultane- 
ously. 

Of course, the emphasis that has been 
placed here upon the overt behavioral 
aspects of rigidity phenomena is not en- 
tirely the result of limitations raised by 
experimental control and available 
modes of measurement. It seems to be 
justified to some extent by the fact that 
in everyday situations we judge others 


to be rigid or flexible in terms of their 
overt behavior.. The flexible person 
shows that he is flexible by dealing with 
situations in a manner that can be de- 
scribed as "flexible." The rigid person 
actually behaves rigidly in his overt re- 
sponse to various stimuli. Indeed, it is 
questionable whether теге subjective 
awareness of alternatives in a situation 
is a significant index of personality flexi- 
bility. 

This project has the following ob- 
jectives: 

1. To set up a battery of choice or 
problem situations in which there are 
variable numbers of roughly equivalent 
modes of response, and then to de- 
termine whether individuals show any 
measurable difference in the degree to 
which they utilize few or many alterna- 
tives from situation to situation. The 
range of situations used in the battery is 
Such as to include reactions to materials 
varying in meaningfulness, complexity, 
and emotional involvement. 

2. To relate any observed individual 
consistencies in grasping and utilizing 
alternatives to measures of personality. 
The personality measures include the 
Rorschach test and Guilford'ss STDCR 
(12). 

3. Furthermore, to determine whether 
observed rigidity consistencies are corre- 
lated with different “types” of functional 
maladjustment. This is attempted by 
using subjects who are psychiatrically 
diagnosed as conversion hysterics, and 
paranoid Schizophrenics, and also nor- 
mal subjects. 

Summarizing and restating the above 
in another form, it may be said that the 


* A questionnaire which contains 175 questions 
pertaining to subjective feelings, degree of So o 
participation, mood changes, etc. It presumably 
measures five personality factors. 
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Study aims at answering such questions 
as the following: а $ 

1. Are individuals consistently rigid? 
That is, do they show a constant level 
of rigidity that cuts across a wide range 
of their responses? 

2. Are there distinctly different kinds 
of rigidity? If so, what is the relative 
importance of cach in the personality 
structure? 

3. Do individuals who are in general 
emotionally constricted (as indicated by 
the Rorschach test) show a correspond- 


ing rigidity in other phases of their be- 
havior? . 

4. Does an individual's manner of de- 
scribing himself (as indicated by the 
Guilford STDCR) have any relationship 
to rigidity? 

5. Do neurotics show specific rigidity 
trends not manifested by normal sub- 
jects? In what way is the rigidity of the 
paranoid schizophrenic similar to, or 
different from, the rigidity of the con- 
version hysteric? ; 


СНАРТЕК П 


SUBJECTS 


S subjects participated in the 
project. They comprised three 
distinct groups: twenty normals, twenty 
conversion hysterics, and twenty para- 
noid schizophrenics. To hold the sex 
factor constant only women subjects 
were used. An attempt was made also to 
keep the intelligence factor relatively 
constant by selecting only those persons 
whose intelligence scores fell roughly 
within the 95 to 110 IQ range. Actually, 
when final IQ means were computed for 
each group it was found that the normal 
subjects exceeded the hysterics by six 
points and the paranoids by almost five 
points. There was not a significant differ- 
ence between the paranoids and hys- 
terics. As will be shown later, the influ- 
ence of the intellectual factor on the 
total results was negligible. Each group 
will now be described in detail. 


Normals 


The normal group consisted of nine women 
who were in their third or fourth year of train- 
ing as occupational therapists; five student nurses 
in their last year of training; two student social 
workers in their first year of graduate work; and 
four individuals employed as stenographers in a 
hospital setting. The age range was from 20 to 
45—with the majority falling in the twenties. 
The stenographers in the group were obtained 
when they responded to a posted notice, but all 
others were selected from larger available groups 
because they fell within the 95-110 IQ range. 
No attempt was made to screen the group in 
advance for possible pathology. It was assumed 
that the individuals in the group were "healthy" 
or "normal" because they were not hospitalized 
and appeared to be functioning at their partic- 
ular jobs without any gross signs of maladjust- 
ment. In general, the participants were all very 
cooperative during examination and expressed 
much spontaneous interest toward various of 
the tasks. At no time was the real purpose of the 
testing revealed. The various procedures were 
simply described as a series of tests that were 
being given to different groups for comparative 
purposes. With but a few exceptions all testing 
was completed in one session. 


TABLE 1 


DISTRIBUTION OF AGE IN YEARS, EDUCATION IN YEARS, INTELLIGENCE, 
AND OCCUPATION IN THE EXPERIMENTAL GROUPS 
(N=60 Women) 


ч Age Intelligence Education 
roup Occupati 
Mean | Range | Mean | S.D. | Mean | Range куо 
NORMALS 24 20-45 | 107.2* | 3.8 | 15 12-16 9 occtipational therapy students 
5 student nurses 
2 student social workers 
4 stenographers 
sss 35 | 18-50 | 100.6 | 4.9 | ro 8-12 | 9 housewives 
RICS 4 stenographers 
7 in miscellaneous factory and 
clerical work 
PARANOID 37 | 21- i 
55 | 102.6 5.3 | 10.8 7-16 | ro housewives 
SCHIZOPHRENICS 5 clerical workers 
1 florist 


4 miscellaneous factory work 


a Exceeds hysterics and paranoids significantly. 
b Not significantly different from the paranoids. 
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Hysterics 


"The hysteric group consisted T twenty women 
who had been diagnosed "conversion hysteria” 
at the Illinois Neuropsychiatric Institute. Nine- 
teen of these women were outpatients and one 
was an inpatient. The age range was from 18 
to 5o, with the mode in the late thirties. А 
wide variety of occupations was represented in 
the group; but the "housewife" category tended 
to predominate. The educational level of the 
group was significantly inferior to that of the 
normals; and a year or two of high school was 
the most frequently indicated. Most of the 
group was told that the testing procedures rep- 
resented part of the routine psychological test- 
ing program for patients. However, in some 
instances due to special circumstances the test- 
ing was represented as an “experimental project 
with particular interest in your type of case.” 
The general level of cooperation in the group 
was good. In the few instances where subjects 
showed themselves to be unusually uncoopera- 
tive testing was discontinued and the results 


were not used. For about half of the subjects 
it was necessary to utilize two separate sessions 
before the test series was completed. 


Paranoids 


The paranoid group consisted of twenty 
women psychiatrically diagnosed paranoid schi- 
zophrenics. Nine of these subjects were obtained 
at Elgin State Hospital, five at Chicago State 
Hospital, and six from the Illinois Neuro- 
psychiatric Institute (three inpatient and three 
outpatient). The age range was from 21 to 55, 
with a mode in the late thirties. The educational 
and occupational representation was similar to 
that of the hysteric group. The general level of 
cooperation was significantly poorer than was 
the case with the other groups. Of course, all 
subjects used were sufficiently cooperative so 
that they were willing to undergo a number of 
hours of testing. Most of the group required at 
least two sessions and several as many as three 
sessions before testing could be completed. 


CHAPTER III © 


GENERAL PROCEDURE AND MEASUREMENT TECHNIQUES 


A. PROCEDURE 


HE actual procedure used in the 
examination of each subject was 
somewhat as follows: 

1. Each individual subject was first 
given the verbal section of the Wechsler- 
Bellevue to determine if the intelligence 
level was within the average range. 

2. A Rorschach record was then taken. 

3. After the subject had had a brief 
rest period, the battery of rigidity 
measures was administered (in the same 
order as they are described below). 

4. Finally, the subject was asked to 
fill out the Guilford STDCR. 

The total time of examination ranged 


from three hours to seven hours—with ' 


a mode of about three and one-half 
hours. In all instances where subjects 
seemed fatigued extra rest periods were 
introduced. The longer examinations 
were always spaced over a two or three 
day period. 


B. MEASUREMENT TECHNIQUES 


The tasks finally included in the 
rigidity battery were selected to repre- 
sent a scatter of situations ranging from 
those in which the subject deals only 
with very simple stimuli and a limited 
number of alternatives, to those in which 
the subject deals with very complex 
stimuli and multiple alternatives. Fur- 
thermore, they were selected to include 
tasks varying widely in the degree to 
which they might "involve" the subject 
in an emotional sense. Thus, there are 
two parameters running through the 
tasks: А complexity parameter and an 
emotional involvement parameter. It is 
important to note that a large number 
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of overlapping tasks are employed so 
that possible failure of any single task 
to function as a rigidity measure in any 
individual case will be compensated for 
by the presence of other overlapping 
tasks, It should be added that a pre- 
liminary test of nineteen tasks was 
undertaken with ten subjects. Those 
were retained in the final battery that 
represented the easiest to administer and 
seemed to provide the best measure of 
rigidity. 

Each task in the rigidity battery will 
now be described. 


1, Vigotsky Blocks 


The Vigotsky blocks consist of 22 wooden ob- 
jects of various forms, colors, and shapes. (16, 
рр. 462-484). Each subject is instructed: “Show 
me all of the different ways these blocks can 
sensibly be divided into groups. Show me one 
way; then another—until you've shown me all 
of the ways you can think of.” Scoring is based 
on the number of such groupings he is able to 
make, The specific details on scoring may be 
found in Appendix I. 


2. Colors 


Each subject is asked to express his liking for 
each of eighteen colored ribbons of varied hues 
and patterns and about four inches in length. 
As each ribbon is presented separately to the 
subject, he is asked: “Do you find the color 
of this ribbon to be pleasing or not pleasing? 
Do you like it or not like it?” The number of 
ribbons reacted to favorably constitutes the final 
score. It is here assumed that limitation in num- 
ber of positive or favorable reactions is a meas- 
ure of at least one aspect of rigidity. 


3. Trait Judgment 


The subject is presented successively with 
several series of pictures of persons—each series 
consisting of ten pictures, In the first series the 
subject is told: "Select all those pictured in- 
dividuals that appear to be friendly." In the 
second he is instructed to select all those that 
appear to be ambitious; in the third all those 
that appear to have a sense of humor; in the 
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fourth all those that appear to be happy; and 
in the fifth all those that appe® to be intelli- 
gent. It is assumed that altEough pictures actu- 
ally afford some valid criteria for making the 
above judgments, that the restrictiveness and 
criticalness of judgment for each series is to a 
significant degree a function of rigidity trends. 
The number of pictures accepted as belonging 
to cach trait category constitutes the final score. 


4. Comparison of Self with Pictured 
Persons 


Series r.—The subject is shown successively a 
Series of thirty pictures of different women. The 
Subject is then told: "As I show you cach pic- 
ture tell me whether you think the person 
shown in the picture is a person like yourself— 
whether she has the same kind of personality 
as yourself.” Additional instructions are given 
about disregarding any similarities in physical 
appearance between the pictured person and 
the self and concentrating on a general impres- 
sion of the personality of the person represented. 
The number of pictures that the subject is able 
to accept or perceive like himself constitutes 
the total score. 

Series 2.—The subject is shown successively 
fourteen distinctly unpleasant pictures of women 
taken from the Szondi (24) series and is asked 
e decide in each case whether the person shown 
s: 

(а) "Quite a bit" like the self; 

(b) "A little bit" like the self; 

(c) "Not at all" like the self; or 
, (d) Whether it is impossible to reach a deci- 
Sion about the picture in question. 

These instructions were given orally; and re- 
peated as many times as necessary until it was 
Clear that the subject comprehended them. 

Scoring of the judgments is based on a 
Weighting system that credits +3 for answer 
4"; +2 for answer "b"; +1 for answer ug"; 
and o for answer “c.” It is assumed that the 
Breater the total score, the less the rigidity that 
1s manifested. 


5. Thematic Apperception Test 


4. The subject is shown successively each of 
ten ТАТ cards, He is instructed; "Tell me all 
9f the different things this picture might sensi- 
bly represent. What are the different things 
that might be happening in the picture?" 
Particular care is taken to motivate the subject, 
to make him feel that he should give as many 
interpretations as possible. The total score is 
based on the number of interpretations offered 
—but qualified in terms of plot quality and also 
the degree of difference of those interpretations 


given for the same TAT card. Details of the 
scoring system may be found in Appendix II. 

b. The subject is shown the same ten TAT 
cards. In each instance he is asked to select from 
a printed list of alternatives (six per TAT card) 
all of the various kinds of situations that the 
card might sensibly represent. The total score 
is based on the number of alternatives selected 


by the subject. 


6. Annoyances 


a. Each subject is told: "I am going to read 
a list of things to you that people sometimes 
find annoying. As I read each thing to you, tell 
me whether you find it annoying." A list of 
twenty-eight annoyances is read to the subject 
one at a time. The supposition underlying this 
procedure is that the individual who is least 
open to grasping alternative ways of interpreting 
or dealing with situations will tend to react 
to an unusually large number of aspects of 
persons and things in an unaccepting (or an- 
noyed) fashion. Clearly, rigidity is here not so 
directly definable in the form of a simple “num- 
ber of selected alternatives." Rather, there are 
emore intervening assumptions—the principal of 
these being that personality rigidity will reflect 
itself to at least some degree in limitations of 
outlook and a greater tendency to be rejecting 
of that which is "different" or new. The total 
score is the sum of indicated annoyances, and 
the larger the score the greater the degree of 
rigidity. 

b. Each subject is also asked to select from 
the same list all those which he thinks "most 
people" find annoying. When the task is per- 
formed with this frame of reference, the total 
score is based on the number of annoyances 
ascribed to “most people.” The same logic lies 
behind this procedure as indicated for "self an- 
noyances.” However, interest is focused on the 
rigidity shown when а new, forced frame of 


reference is involved. 


7. Range of Interests 


a. Each subject is instructed to select from a 
list of thirty-seven different activities (e.g. 
swimming, dancing, etc.) which are read to him 
one at a time all those he feels he enjoys doing 
very much. He is told: “I am going to rcad a 
list of activities to you. Tell me which of them 
you enjoy doing very m uch.” The same rationale 
lies behind this procedure as for the use of 
annoyances as a measure of rigidity. The total 
score consists of the number of indicated in- 
terests. The greater the number of interests the 
smaller is the degree of rigidity. 

2. Each subject is also asked to select from 
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the same list all those he feels "most people" 
enjoy doing very much. When the task is per- 
formed with this frame of reference, the total 
score is equal to the number of interests ascribed 
to "most people." 


8. Rosenzweig Picture Frustration 
Test (19) 


This test contains a series of cartoon situations 
in which one person says or does something to 
frustrate another person. The subject usually 
writes in a designated box what he thinks the 
frustrated person would зау in response to the 
frustration. Fourteen of these cartoon situations 
are used. The subject in each instance selects 
from a list of five alternative printed responses 
those that he thinks he himself might make 
under the circumstances. The instructions are: 
"In each case tell me all of the things in the 
list that you think you might say if you were 
in that situation and had to answer the other 
person." In each printed list the responses are 
derived from norm lists published by Rosen- 
zweig. 

It is clear from Rosenzweig's data that a 
wide range of responses is possible in each. 
cartoon situation, and so to а large extent rc- 
striction as to choice of likely responses is as- 
sumed to be a rigidity function. The final score 


is equal to the number of alternatives selected 
by the subject. 


9. Blots , 


The subject is shown a series of twenty-one 
ink blots taken from the Terman M-F (25). As 
he views each blot he is asked to select from a 
printed sheet containing a listing of possible 
things the blot might represent (five alternatives 
per blot) all of the things that he thinks the 
blot could sensibly be said to represent. The in- 
structions are: “In each case pick out from the 
list that goes with the ink blot all of those 
things that you think it 
sent or be." Since the alte 


are all rather logical pos: 
ОЁ choices acce 
the final score. 


could sensibly repre- 
natives for each blot 
sibilities, the number 
Pted by the subject constitutes 


10. Hand. Steadiness Aspiration Task 


The apparatus used here is a modification of 
that described by Dunlap (9). It consists of a 
Series of 11 equally spaced holes of decreasing 
diameters drilled near the periphery of a circ- 


ular brass plate which can be rotated. The brass 
plate is mounttd on a stand and is at about 
"chest level" for a sıbject in a sitting position. 
A stylus about 15 inches in length is fixed in 
front of the plate so that its tip is inserted in 
the third smallest hole. This stylus is on a pivot 
which allows it to move freely in any direction. 
The subject holds the stylus with his favored 
hand in any way he finds "most comfortable." 
The apparatus appears to be connected to a 
counter—but actually is not. Consequently, scores 
can be reported to a subject arbitrarily as the 
experimenter desires. The instructions to the 
subject are: “This is a test to see how steady 
you can hold your hand and also to find out 
how well you can estimate your own ability to 
do things. With your eyes closed try to hold 
this rod in the middle of the hole as steady as 
you can. At the end of each trial I will tell you 
how many times you actually touched the sides 
of the hole. Then, you will try to estimate in 
advance the number of times you will touch 
it on the next trial." He is then given a practice 
trial for thirty seconds with his eyes closed. 
After the practice trial, ten regular trials are 
run. Before each trial the subject estimates what 
he expects to achieve; and at the end of the 
trial an arbitrary score, and yet one reasonable 
in terms of the possible achievement range, is 
reported to him. Following the first trial and 
for three further successive trials the subject 
is permitted to improve his performance pro- 
gressively in terms of a predetermined pattern. 
Failure to raise or lower aspiration estimates 
is taken as the scoring basis. The reasoning 
underlying the use of this task is that on each 
trial the subject has the choice of a theoretically 
wide range of estimates clustering somewhere 
near the “real score” reported by the experi- 
menter. Consequently, апу resistance he shows 
to the acceptance of the full range of possibili- 
ties and any predilection he shows for "sticking 
to" his own previous estimates serve as an index 
of his inability to utilize a range of alternatives 
in his reactions to situations (especially those 
Of a stress character). Three scores are derived 
from this task: (1) One indicating the amount 
of resistance to change when progressive improve- 
ment 1$ reported to the subject; (2) One indicat- 
ing the amount of resistance to changes when, 
kill is reported; and (3) 
ree of difference between 


р pendix III for details of 
the scoring procedure. 
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o CHAPTER IV 


ANALYSIS OF DATA 


A. VALIDITY CRITERIA 


1. Rorschach 


r USING the Rorschach test as а 
validity criterion against which to 
compare results obtained from the 
rigidity battery, it was decided to work 
out three separate measures: 

a. A measure of personality rigidity. 

b. A measure of degree of total malad- 
justment. 

c. A measure of degree of pathological 
looseness or disorganization. 

There are different ways in which one 
can approach the problem of evaluating 
Rorschach material. When using the 
Rorschach clinically, one usually at- 
tempts to grasp the various numerical 


scores obtained, in terms of total broad. 


patterns or sub-patterns. Actually, the 
Rorschach literature contains abundant 
evidence that this is the most fruitful 
mode of analysis. However, since broad 
patterns are extremely difficult to express 
quantitatively, most controlled studies 
involving the Rorschach test have tended 
to rely on correlations of single scores 
(out of context of the total pattern) with 
other variables. Such analysis of isolated 
Scores has generally been very disap- 
pointing and for this reason an attempt 
has been made here to define the 
Rorschach measures so that they retain 
some of the quality of pattern evalua- 
tion. This has been done by working 
out a scheme whereby individual scores 
are given different weights in terms of 
their quantitative relation to other 
scores, 

The above procedure will become clearer as 


the specific scoring systems for evaluating rigid- 
ity maladjustment and pathological looseness 


are described in detail. The deficiencies of such 
scoring systems must of course be kept in mind 
constantly. At most, they are only rough tech- 
niques of evaluation. They only touch on the 
complexity of analysis that goes into a careful 
qualitative consideration of a Rorschach record. 
Many complicated questions regarding relations 
of Rorschach factors would have to be worked 
out carefully in any project directed at Rorschach 
quantification per se; but these questions have 
necessarily received only a moderate degree of 
attention here. However, as will later be in- 
dicated, all of the scoring systems seem to be 
sufficiently differentiating to have possibilities 
as validity criteria. 


a. The Rorschach measure of per- 
sonality rigidity is based on a variety of 
Rorschach signs which clinically have 
been found to characterize persons who 

®are habitually constricted and who find 
it necessary to deal with the environment 
with an unusual amount of guarded 
caution. Quite literally, these signs are 
derived from indicators of the degree to 
which individuals limit the range and 
character of their responses to the 
Rorschach ink blots. Thus, limitation of 
number of responses; limitation in the 
use of color in perceptions; restriction 
of percepts to one class of objects (e.g. 
animals); and very delayed reaction 
times are illustrative of what is con- 
sidered rigid or restricted behavior in 
the Rorschach test. To construct a 
measure which would objectively convey 
the amount of Rorschach rigidity shown, 
a variety of the above described 
Rorschach signs were defined in terms of 
various scores (4) and given a series of 
specific weights. The particular rigidity 
signs selected and the weights assigned 
them were determined in an arbitrary 
manner based on general clinical experi- 
ence. It is not possible to demonstrate 
that the signs or weights selected are the 
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truest measure of personality rigidity in 
the Rorschach sense. They were chosen 
to express what was qualitatively sug- 
gested in the Rorschach data obtained. 
It is likely that better weighting systems 
could be worked out; the one presented 
here is merely the best that could be de- 
veloped under the conditions of this 
project. The value of this procedure is 
that it gives an independent measure of 
rigidity based on an instrument which 
has proved clinically useful. Thus, some 
frame of reference for evaluating the 
rigidity battery results is provided. The 
details of the scoring system may be 
found in Appendix IV. 

The differences in Rorschach rigidity 
among the clinical groups are shown in 
Table 2. It may be noted that the nor- 
mal group is significantly less rigid than 


either the hysteric or paranoid groups. 


TABLE 2 


E DISTRIBUTION OF RORSCHACH 
IGIDITY SCORES IN THE THREE EXPERIMENTAL 
GROUPS, AND DIFFERENCES IN RORSCHACH 
RIGIDITY AMONG THESE Groups 


CONSTANTS OF TH! 
К 


Normal | Hysteric | Paranoid 
Mean 24.9 44-3 44.1 
СА 11.8 17.2 13.6 
ст am 3-9 3.1 

| Normals | Normals Hysterics 

vs vs vs 
Hysterics | Paranoids | Paranoids 
Mean 

Difference 19.4 19.2 .2 
oD 4.7 4.1 5-0 
С.В. 4.1 4.7 VI 


The fact that no Significant difference 
exists between the gross scores of the 
paranoids and conversion hysterics is 
consistent with results recently obtained 
by Angyal (з). 

b. The Rorschach maladjustment 
Score is also derived in terms of pattern 
criteria, A range of Rorschach signs 


which are clinically recognized as indi- 
cating various degrees of personal malad- 
justment were selected. Each sign was 
given a weight in terms of its rclation 
to other factors and its assumed im- 
portance for diagnosing maladjustment. 
Details of the scoring may be found in 
Appendix V. The manner in which the 
Rorschach maladjustment score differ- 
entiates the clinical groups is shown іп 
Table 3. 

The normal group is very well differ- 
entiated from both the hysterics and 
paranoids, However, it is clear that the 
hysterics and paranoids are less well 
differentiated, although still significantly 
in a statistical sense. Aside from defects 
in the scoring system used, some of the 


overlap may be attributed to the follow- 
ing factors: 


1. Several of the hysterics 
adjusted—perh 
chosis. 


are extremely mal- 
aps bordering on incipient psy- 


2. À number of the paranoids had been hos- 
pitalized for acute episodes which had occurred 
some time previous to the time of study and so 


might well have attained various stages of re- 
covery. 


„3. It is generally recognized that paranoid 
disorders are difficult to detect on the Rorschach 


test—even when careful qualitative analysis is 
utilized, 


TABLE 3 
Constants or THE DISTRIBUTION or RORSCHACH 
MALADJUSTMENT Sc. 


ORES IN THE THREE EXPERI- 
MENTAL GROUPS, AND DIFFERENCES IN ROR- 


SCHACH MALADJUSTMENT AMONG THESE GROUPS 


Normal | Hysteric Paranoid 
Мп 36.9 59.7 85.5 
T II.0 19.1 32.4 
m 2.5 4.3 7.4 
Normals Normals Hysterics 
vs vs vs 
Hysterics Paranoids Paranoids 
Mean 
Difference 22.8 48.6 25.8 
тр ax 7:7 735 
CR. 8.1 6.3 3.4 
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In general, then, it would appear that 
the’ Rorschach maladjgstment score is 
able to detect personality maladjustment 
with at least moderate efficiency and 
that one may feasibly use it as a “mal- 
adjustment criterion" for interpreting 
the results of the rigidity battery. 

c. The “looseness” or pathology score 
is a separate compilation of all penalty 
weights given in the maladjustment scor- 
ing which are judged to indicate loose- 
ness or disorganization in the personality 
structure. It seemed desirable to have a 
separate index which would point up 
more serious pathology, such as ре- 
culiarities in thinking logic or unusual 
perseverative interest in sexual topics. 
This score is used only for a better 
understanding of the maladjustment 
score, Details relative to the penalty 
weights which go into this score may be 
found in Appendix VI. 

An attempt was made to derive a num- 
ber of other Rorschach scores (e.g. de- 
gree of general hostility and reaction to 
stress), Because of their limited range of 
variation and their general lack of 
promise they were not given further 
consideration. 


2. Guilford's STDCR 


This inventory was not used with the 
intention of actually measuring specific 
real personality factors such as S, T, D, 
G, R that the device purports to analyze 
and then comparing them with rigidity 
battery results. The main purpose in 
utilizing the STDCR was to have a uni- 
form means of analyzing a subject's way 
of describing himself. That is, all of the 
questions in the STDCR inquire about 
some behavioral or subjective aspect of 
the subject; and the way he answers 
these questions constitutes a form of self 
description. In answering the various 
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questions he is implicitly saying such 
things as: "I feel happy (or sad)" “I 
have many (or few) friends." "I am calm 
(or excitable,” etc. It is recognized, of 
course, that in some instances a subject 
may have so much insight into himself 
that his self descriptions are actually 
valid indicators of his real personality. 
However, the fact of these descriptions 
being self descriptions remains. 

With this point in mind, the highness 
or lowness of the total of the five factor 
scores is the chief STDCR measures 
used. A high score means that the subject 
describes himself in favorable or socially 
"good" terms. It means that he refers to 
himself as happy, well adjusted, satisfied, 
etc. A low score indicates that the sub- 
ject describes himself in unfavorable 

„poor adjustment" terms. No assump- 
tions regarding the motivation for these 
different kinds of self descriptions are . 
made. The STDCR scores will be treated 
simply as empirical indicators of favora- 
ble or unfavorable self description. 

It was noted that an additional point 
to be taken into account when interpret- 
ing the STDCR total as a mode of "self 
description" was the amount of variation 
in the. STDCR scores. That is, a high 
total accompanied by moderately high 
variation seemed to have a different sig- 
nificance than a high total accompanied 
by low variation. For this reason a varia- 
lion score for each series of STDCR 
scores was worked out. If the five 
STDCR scores are considered to be 
ranked in order from highest to lowest 
("favorable" self description to “un- 

favorable” self description), the variation 

score is equal to the difference of the 
fifth score from the fourth score plus 
the difference of the third score from 
the second score plus twice the difference 
of the fifth score from the first score. 


12 SEYMOUR. FISHER 


This specific way of computing variation 
is an attempt to maximize those aspects 
of the variation which seemed most im- 
portant (in terms of qualitative inspec- 
tion) for understanding the highness or 
lowness of the STDCR totals. Further 
discussion. of the variation score will 
occur at a later point. 

That the STDCR total in this study is 
largely not a measure of real personality 
factors but is at least to some degree only 
a subjective kind of self description is 
implied by the results in Table 4. Thus, 
as can be seen from Table 4, the para- 
noid schizophrenics do not differ signifi- 


TABLE 4 


CONSTANTS OF THE DISTRIBUTION oF STDCR 

TOTAL SCORES IN THE THREE EXPERIMENTAL 

GROUPS, AND DIFFERENCES IN STDCR TOTAL 
Scores AMONG THESE GROUPS 


Constants | Normal Hysteric | Paranoid 
Mean* 32.1 20.9 28.2 
с 5.1 7.2 8.1 
Om 1.2 1.6 1.8 
Normals | Normals Hysterics 
Constants vs vs vs 
Hysterics | Paranoids Paranoids 
Mean 
Difference II.2 3.9 7.3 
тр 2.0 2.2 2.4 
С.К. 5.6 1.8 3.0 


* The higher the mean, the 


y ger. 1 "better" the self 
description implied. 


cantly from the norma] subjects. This is 
true also when the fa 


ctor scores are con- 
sidered 


individually. Furthermore, al- 
though the paranoids are clinically and 
in terms of test criteria more disturbed 
than the conversion hysterics, their 
STDCR scores would misleadingly imply 
that their personality traits and reactions 
are more appropriate and adjustive than 
those of the hysterics. Further evidence 
in a similar vein Will be presented in 
more precise terms at а later point. 


3. Glinical Diagnosis 

It should be briefly noted that when 
this project was first undertaken con- 
version hysterics and paranoids were in- 
cluded on the widely held assumption 
that the hysterics would represent at one 
extreme the non-rigid type of personality 
and the paranoids at the other extreme 
a very rigid type of personality. It was 
thus that a clinical validity criterion 
for judging the rigidity battery would 
have been provided. However, the ob- 
tained results (verified by another study, 
3 PP: 133-135) indicate that conversion 

ysterics and. paranoids lie at the same 
extreme of rigidity, Consequently, 
validation comparisons are restricted 
mainly to contrasts between the normals 
on one hand and the conversion hys- 
terics and paranoids on the other hand. 


B. PRELIMINARY ANALYSIS oF INDIVIDUAL 
TASKS IN THE КісрІтү BATTERY 


This first phase of analysis is intended 
to describe very briefly some of the 
characteristics of the individual tasks in 
the rigidity battery: to make clear in a 
general fashion how the individual tasks 
are related to a variety of validity 
criteria and to point out the limitations 
of their independent use. 

It is quite clear from the data ob- 
tained that no single rigidity measure in 

is very useful in terms 
i ficiency, Most of the 
comparisons with 
h there is a certain 
in relationship 


ut the results. 
Thus, (as shown in Та} 
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But at the same time in twenty-three out 
of thirty instances in which normal 
means are compared with either hysteric 
or paranoid means the normals are indi- 
cated to be less rigid; and similarly of 
the thirty critical ratios derived from 
these comparisons thirteen are 1.8 or 
higher. Although eleven out of fifteen 
mean comparisons favor the hysterics as 
less rigid than the paranoids, these 
differences are so small and unreliable 
that no real conclusions on the point can 
be drawn. 

When the correlations of single task 
rigidity scores with Rorschach rigidity 
scores are computed (see Table 6) results 
are similar to those obtained from the 
comparison of the means. That is, very 
few significant correlations appear, al- 
though there is a certain trend of rela- 
tionship throughout. Thus, no single 
Score correlates well with Rorschach 
rigidity in every group; and only two 
tasks (self interests and spontaneous 
TAT interpretations) correlate 40 or 
higher for the three groups combined. 
However, five out of eleven correlations 
for the total group are 30 or higher. 
Sixteen of the individual group correla- 
tions are „го or higher and seven of these 


correlations are .40 or higher. Here too, 
the fact of low relationship is outstand- 
ing. 

A general comparison of single task 
Scores with Rorschach maladjustment 
Scores further emphasizes the limitations 
of these measures of simple tasks when 
used independently. As can be seen, no 
given task (Table 7) correlates signifi- 
cantly with Rorschach maladjustment in 
all of the diagnostic groups. One is 
Struck by the peculiar range of coeffi- 
cients. Rigidity on some tasks correlates 
positively with degree of maladjustment 
and in other instances rigidity correlates 
negatively with maladjustment. Indeed, 
when one observes, for example: (a) that 
flexibility in color preference has a —.66 
relation to Rorschach maladjustment 
in the paranoid group but a -L.16 re- 
lationship in the normal group; (b) or 
that low rigidity in terms of self interests 
has a —.55 relation to Rorschach mal- 
adjustment in the hysteric group, but a 
dera relationship in the paranoid group 
—it may be assumed that the relationship 
of rigidity to maladjustment is probably 
a complex one that will not reveal itself 
in these single isolated measures. Some 
of the factors underlying the range of 


TABLE 6 
CORRELATIONS* ОЕ INDIVIDUAL RIGIDITY ScorEs AND RORSCHACH RIGIDITY SCORES 
Variables Noa Es Paranoids Total group 

Rho Rho Rho Rho 

Colors — t — 
Traits Judgment = ui ж E — .06 —.19 
Pictures Like Self 4-.06 AR +.28 +.16 
Self Annoyances —.20 сй 3 —.10 +.17 
Other Annoyances —.19 5 59 y —.0$ 4.12 
Self Interests E.4 queo XE — og 
Other Interests к {> Eg +.os +.42 
Independent TAT TEC dme SE PE 
Blob def deis Es +.49 
А -16 +.s8 ia T 34 
Aspiration (F) 4:46 as se + р +.34 
Ё Че SO 
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TABLE 7 
CORRELATIONS or IxprfipuaL RIGIDITY TASKS WITH RORSCHACH MALADJUSTMENT 
р] 
Normals Hysterics Paranoids | Total Group 
Variable: 

Kd Rho Rho Rho Rho 
Color +.16 —.20 —.66 +.16 
Trait Judgment +.51 +.02 +.20 +.13 
Pictures Like Self (usual series) + .60 +.13 +.38 —.10 
Annoyances (S) +.39 —.54 + .03 + 04 
Annoyances (О) +.52 —.07 +.12 +.03 
Interests (S) —.13 —.88 +.25 —.19 
Interests (O) =.17 —.12 —.12 —.o2 
Independent TAT —.21 +.12 +.42 —.28 
Choice TAT —.17 —.20 +.38 —.19 
P-F — .34 34 +.20 —.30 
Blots —.24 — .o8 +.34 —.23 
Aspiration (F) —.49 +.о2 —.24 —.32 


correlations here obtained will become 
clearer at a later point when more com- 
plex analyses of the data are described. 
Altogether, then, one may conclude 
that there is little to be gained by at- 
tempting to measure ог understand 
rigidity and its related phenomena 
purely in terms of each separate task 
comprising the rigidity battery. 


C. ANALYSIS оғ TOTAL RIGIDITY SCORE 


The total rigidity score is equal to the 
sum of an individual's ranks for sixteen 
subtest scores. This general measure 
proves to be a more satisfactory one than 
any of the single task measures. It has 
also proved itself more effective than 
various combinations of individual 
tasks that were attempted. Its differ- 
entiations and correlations relative to 
validity criteria are at a more significant 
level. Before preceding to an analysis of 
these relations with validity criteria it 
will be well to deal with the question 
of whether the single task scores show 
some kind of consistent relationship to 
the total score and to what degree they 
Overlap in their contribution to the total 
score, 

The single scores show a low to moder- 


ate relationship to the total score. Thus, 
for the normal group only six of the 
twelve correlations are .30 or higher and 
only three are'.40 or higher. Likewise, 
for the hysteric group seven of the 
*twelve correlations are .30 or higher and 
four .40 or higher. In the paranoid group 
one finds only four correlations that are 
.go or higher and only one correlation 
equals or exceeds .40. There are distinct 


TABLE 8* 


CORRELATIONS OF SINGLE RIGIDITY SCORES 
WITH TOTAL RIGIDITY 


Normal | Hysteric | Paranoid 

Variables — 
Rho Rho Rho 

Color T.27 —.26 — .o2 0H 

Traits + .03 +.24 +.34 ' 
Self (usual) 4.37 +.38 +.36 
Annoy. (S) +.21 +.13 —.31 
Interest (S) +.32 +.56 +.05 
Annoy. (O) +.19 —.22 —.05 
Interest (O) +.12 4.22 4.04 
ТАТ (ind.) +.53 +.48 +.25 
ТАТ (Choice) +.52 +.67 +.28 
P—F +.46 +.38 Ape 
Blots |. +.33 +.52 +.56 
Aspiration (F) +.23 +.30 Toa; 


a Four of the sixteen individual tasks are not 
included in this analysis because their restricted 
scatter made correlations rather meaningless. 

b [n each case where a total score was cor- 
related with a specific individual score, the in- 
dividual score had previously been subtracted 
from that total score. 
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differences in the degree to which the 
single tasks seem to measure whatever 
there is that is common to the total 
score. Thus, color preferences, self 
annoyances, annoyances of others, and 
interests of others tend with but a few 
exceptions to give very low coefficients. 
On the other hand, such tasks as inde- 
pendent TAT interpretation, choice 
ТАТ interpretation, and blot identifica- 
tion tend to range at a distinctly more 
significant level. However, it is clear that 
in general the single tasks do not meas- 
ure any simple common uni-dimensional 
factor in any real sense. The significance 
of this fact will be clarified and ex- 
panded at a later point, 

To determine how much overlap or 
duplication there was among the various 
individual tasks, scatter-diagrams were 
plotted in all those instances where in- 
spection of the data might lead one to 
anticipate significant relationships. 
Correlation coefficients were then de- 
termined for those scatters that suggested 
significant overlap. Although there are 
numerous instances in which specific 
tasks give moderately high correlations 
within the setting of a given diagnostic 
group and although there are several 
cases in which moderately high correla- 
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tions obtain when the three diagnostic 
groups are Combined, there is no in- 
stance in which the overlap between two 
tasks persistently appears in each of the 
diagnostic groups. Such data tend to im- 
ply that none of the tasks duplicate cach 
other to such an extent that it would 
be considered advisable to let one repre- 
sent another. А more specific indication 
of the manner in which the tasks do 
overlap is shown by Table 9. 

Most of these correlations are below 
-50; and only three of them are .бо or 
higher. (It will become clear at a later 
point that moderate to high overlap be- 
tween certain task scores is advantagcous 
in the measurement of rigidity tenden- 
cies.) 

Returning to the question of what 
kind of relationships obtain between 
total rigidity scores and criterion scores, 
we find that when groups are compared 
(see Table 10) in terms of their mean 
total scores the normals 
differentiated 


either the hy: 


are significantly 
as being less rigid than 
sterics or the paranoids. 
Once again there is no real difference 
between the hysterics and paranoids. In- 
deed, it is interesting to note how similar 
the means of the two groups are. Among 
the single rigidity tasks, it is true that 


TABLE 9 
Most SIGNIFICANT CORRELATIONS AMONG THE INDIVIDUAL RiGiDrTY Tasks 
Normal: /steri š 
Variables : Hysterics Paranoid 
Rho S.E. Rho S.E. Rho SE. 
Interest S--O +.56 -16 * 
Annoyances S--O бо m NS NS. 
Independent ТАТ +Choice TAT +.72 б NS. +.64 Er 
Traits--Self (Ordinary Series) +43 E NS NS. 
Blots--P—F +.48 18 NS N.S. 
TAT Choice+Interest (S) NS. d qose N.S. 
Blots+Interests (S) N.S. +. +18 NS, 
Blots+Traits NS +.49 -19 N.S. 
Blots--Aspiration (S) NS. N.S. +.48 ‚18 
N.S. +.49 ‚18 
* N.S. = Not significant. 
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CONSTANTS OF THE DISTRIBUTIONS оғ TOTAL RIGIDITY SCORES IN THE THREE GROUPS, 
AND DigFERENCES IN TOTAL SCORES AMONG THESE Groups ` . 


Constants N* H Р N+H N+P 'H+P 
M 418.0 534.0 517.0 
с 91.5 110.0 93.2 
от 20.8 2549 31.1 
D 116.0 99.0 17.0 
тр 33.0 29.3 33.0 
C.R. 3.5 3.4 E 


*N=Normal; H= Hysteric; P = Paranoid. 


Vigotsky sorting and spontancous КАЛ? 
interpretation differentiate the normals 
from the hysterics and paranoids at a 
more significant level than the total 
score; but the superiority of these tasks 
disappears when another criterion such 
as Rorschach rigidity is brought into the 
picture, 

The relation of the total rigidity score 
to Rorschach rigidity tends to be moder- 


TABLE 11 


CORRELATIONS or TOTAL RIGIDITY SCORE 
WITH RORSCHACH RIGIDITY 


Normal | Hysteric | Paranoid Total 


+. +. +.10 +.52 
S.E. Cn S.E. s) S.E. (.24) | S.E. (. 10) 


ately high. It ranges from .10 in the 
paranoid group to .73 in the hysteric 
group. For the three groups combined 
the coefficient is .52. These differences 
are probably at least in part due to the 
narrower range of scores in the normal 
and paranoid groups as contrasted with 
the hysteric group. But there are more 
important reasons for these differences 
which will be discussed at a later point. 
Altogether, the preceding data imply 
that total rigidity scores measure “деп- 
eral" differences in rigidity trends with 
only low to moderate success. 

Although only those persons who fell 
Within the IQ range of 95 to 110 were 


used as subjects, still IQ differences in 
the groups exist. One might ask whether 
the observed general rigidity trend 
differences are not basically due to varia- 
tions in intelligence. 

The correlations shown in tables 12 
and 13 indicate that intellectual differ- 


TABLE 12 


CORRELATIONS OF INTELLIGENCE WITH 
TorAL RIGIDITY SCORE 


Normal Hysteric | Paranoid Total 


+. —.2 —.52 —.$ї 
S.E. em S.E. (a) S.E. Os S.E. (.10) 


ences do contribute in a limited degree 
to the total rigidity score. However, as 
can be seen from the correlations of in- 
telligence with Rorschach rigidity, the 
contribution which intelligence makes to 
the total score seems in general to be 
different from that which the Rorschach 
rigidity scores and the total scores have 
in common. 

And if one assumes that Rorschach 
rigidity is basically some kind of index 


TABLE 13 


CORRELATIONS OF INTELLIGENCE WITH 
RORSCHACH RIGIDITY 


Normal | Hysteric | Paranoid Total 


+.or —.27 — .04 — 38 
S.E. (.24) | S.E. (.22) | S.E. (.24) | S.E. (.т2) 
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of personality rigidity, it would appear 
that the total rigidity score does get at 
a personality rigidity variable which is 
not measured by intelligence. The fact 
that intelligence does in some of the 
groups have a significant influence on 
general rigidity trends is not surprising 
When one considers the interdependence 
of intellectual and emotional factors; 
and remembers that the intellectual 
assets of an individual may aid him in 
solving very personal emotional prob- 
lems or benefiting from psychiatric 
therapy. Generally, the total. rigidity 
Score refers to rigidity trends that are 
fairly independent of intelligence; but it 
is likely that, if one wanted to compare 
individuals of widely differing intelli- 
Bence, separate norms for the different 
intellectual levels would be necessary. 

Some effort has been directed to 
demonstrating that the battery of rigidity 
tasks measures rather consistently some 
common variable that may be called 
“personality rigidity,” and the indica- 
tions are that it does measure such a 
variable to a certain degree. However, as 
One reviews the data and notes such 
facts as unusual shifts in correlation 
from group to group; inconsistent re- 
lationships; and the wide scatter of 
single task scores within many subjects’ 
total scores, two possibilities come to 
mind: 

а. One may hypothesize that the pres- 
ent technique for measuring personality 
rigidity has only a moderate efficiency 
and that there are many situ 
which it is very ineffective as а 
device. 

b. Or, one may postulate that to con- 
ceive personality rigidity as a measurable 
Suture is fione 01 Personality 
That is, one ma fia вар, 

; y assume that there are 


ations in 
measuring 


general tendencies toward rigidity or 
looseness in miost personality structures; 
but that these are only, tendencies and 
that possibly there are different levels 
of rigidity and perhaps even contra- 
dictory currents of rigidity in the same 
individual. Certainly the first hypothesis 
accounts for some of the discrepancies in 
results, but it will be shown in the next 
section that the second hypothesis seems 
most applicable. 


D. ANALYSIS IN TERMS ОЕ LEVELS OF 
RIGIDITY 


If one prepares profiles of the various 
task scores for each individual, one is 
struck by the fact of variation rather 
than of uniformity. It is the rule to find 
that most subjects show a wide range of 
differences in their Separate scores. The 
individual who is uniformly high or low 
is an exception. This js not at all strange 
when one considers the range of diffi- 
culty and emotional Stress represented 
among the tasks in the rigidity battery. 
Some tasks (e.g., color preference or trait 
judgment) simply call for an expression 
of opinion and permit the subject to 


of his decision, 
TAT interpret 


the second ty 
tion in mind 
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a rough arbitrary classification of the 
rigidity tasks into a series*of levels and 
to determine what kinds of relations ob- 
tained among these levels over the range 
of subjects. Finally, the task was seen as 
one of establishing the significance of 
these relationships of levels by using the 
validation criteria available. 

The rigidity tasks may be roughly 
classified into three levels—as defined by 
the criteria that follows: 
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defined. Furthermore, almost immedi- 
ately after such patterns were first ob- 
served it was noted that certain of them 
were associated with individuals who 
might be described as well adjusted (in 
terms of Rorschach and clinical criteria) 
and those personality structures ap- 
peared to be integrated in a balanced 
fashion; whereas other patterns were 
associated with persons who were ill and 


Level I 
The task represents a de- 
mand on the subject which 
he finds difficult to deal with 
because it arouses emotion or 
calls for a type of response 


Level II 
The task involves a moderate 
degree of difficulty, but is at 
the same time subject to con- 
siderable "private frame of re- 
ference" determination. 


Level Ш 
The task represents a de- 
mand to which the subject can 
respond without much diffi- 
culty or in very arbitrary un- 
challengeable terms dictated by 


which is not readily available 
to him in terms of his back- 
ground of experience 


In terms of these criteria the specific 
tasks may be considered to group them- 
selves in the following categories: 


Leuel I 


Level II 
. Self interests. 


his own private frames of re- 
ference. In other words, he may 
respond to such tasks about “аз 
he pleases.” 


с: > = 
disturbed. The pattern associated with 
the adjusted integrated person may be 
pictured somewhat in the fashion shown 


Leuel 11 
1. Color preferences. 


1. Aspiration estimates dur- 1 1 
ing failure. 2. Interests of others. 2. Selection from ordinary 
2. Independent TAT inter- g. Self annoyances. range of pictures of persons 
pretation. 4. Annoyances of others. like the self. 
3. Choice ТАТ interpreta- 5. Aspiration estimates dur- 3. Selection from “unusual 
tion. ing success. ч range of pictures of persons 
4. Blot identification. 6. Vigotsky sortings. / like self. , 
7. Responses to the P-F sit- 4 Judgment of traits from 
uations. pictures. 


The grouping of tasks into levels is as- 
sumed to hold true only “оп the average.” 
Thus, it is possible that for some subjects 
a level II task is actually a Level I task. 
However, the presence of several tasks 
at each level minimizes error of this 
kind, 

When the single rigidity scores of each 
Subject were viewed in terms of the 
above scheme, it was evident that there 
Were definite patterns of relationship 
among them which could be precisely 


at the top of the next page. 

The integrated personality is marked by 
lowest degree of rigidity at level I, with 
moderately increasing rigidity at levels 
II and III. It is also characterized by a 
definite tendency for single tasks at the 
same level to indicate a similar degree of 
rigidity. However, the pattern of the 
disturbed individual frequently departs 
radically from the "healthy" pattern. 
Greatest rigidity may be shown at level 
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CHARACTERISTICS OF THE RIGIDITY LEVELS IN “IDEAL PROFILE” 


Level I 
(A) Of the three levels this 
is the least rigid. 
(B) АП rigidity scores at this 
level cluster at a similar point 
of rigidity. 


of rigidity. 


I instead of level III; or tasks at the 
same level may give widely different 
rigidity scores. In order to test out the 
above suppositions in precise fashion it 
was first necessary to construct an "ideal 
healthy profile," in which the relation- 
ships of representative tasks from each 
level were quantitatively defined. Fur- 
thermore, it was necessary to work out a 
system ôf weighted penalties to score 
the deviations of each subjects profile 
from this “ideal profile.” There follows 
a summary of the scoring scheme that 
was evolved. 

As can be seen in the above summary, 
scoring weights are assigned so as to 
penalize all deviations from the “ideal 
profile" and to give positive credit for 
conformance to it. In sections A and B of 
the scoring summary credits are given in 
terms of the assumption that rigidity for 
level I tasks should be less than rigidity 
for level II and III tasks, Each level is 
represented by sample tasks rather than 
by all the tasks falling at that level. Rank 
differences between the level I and 
level II and III tasks in the optimum 
direction determine the positive weight- 
ing and rank differences contrary to the 
optimum direction determine the nega- 
tive weighting. In section C an evalua- 
tion is made of the degree to which level 
П rigidity is less than level ш rigidity, 
by comparing sample tasks from each 
of these levels. Finally, in section D a 
test is made of the extent to which 
sample tasks from the same rigidity level 
actually give similar rigidity scores, A 


Level II o 
(^) Occupies middle point 


(B) АП rigidity scores at this 
level cluster at a similar point. 


Level Ш 
(A) Highest rigidity. 
(B) АП rigidity scores cluster 
at similar point. 


fixed positive weight is assigned for each 
instance in which the subject responds 
with equivalent rigidity to tasks belong- 
ing to the same level. A range of nega- 
tive weights is used to penalize signifi- 
cant differences in rigidity among tasks 
of the same level, 

The final total score is the difference 
between the sum of the positive weights 
and the sum of the negative weights. 

The scores secured through the above 
procedure manifest some very interesting 
relationships to maladjustment, as de- 
fined by the Rorschach test. Within the 
normal group a very clear correlation 
obtains. The fact of clear correlation is 
also evident when the three separate 
groups are treated as one. However, both 
the hysteric group and the paranoid 
group give insignificant coefficients for 
the two variables. When these two 
groups were analysed to determine why 
they differed so markedly from the nor- 
mals it was discovered that in both 
instances the correlations were thrown 
off by several cases that had certain im- 
portant characteristics in common, It 
was found that these particularly deviant 
cases were all marked by the following: 

1. They were all am 


sick and disorganized о 


—as defined by Rorscha 
Rorsch: 


toms. 
2. They were all dey 


ong the most extremely 
Ë their respective groups 


ch maladjustment scores, 
ach looseness Scores, and clinical symp- 


a: А iants from the correla- 

n trends of their respective groups in the 
Sense that their “balance” rigidity scores im- 
plied unusually little disturbance—whereas their 
Rorschach Scores implied that they were very 
ill. On the basis of the above, a hypothesis was 


Е 
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TABLE 13A 
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SUMMARY OF Mfriop For SCORING RIGIDITY PATTERN DISTURBANCE 


سر 
Š 5 š " Positive 3 Negative‏ 
Optimum Relationship Ranks Credit Ranks Credit‏ 
A. r. Rigidity of aspiration (F) less than | If: If:‏ 
rigidity of color preference 1-2 ranks 1-2 ranks‏ 
Rigidity of aspiration (F) less than | less + тоге =;‏ .2 
rigidity of annoyances (S)‏ 
Rigidity of aspiration (F) less than | 3 ranks 3-4 ranks‏ .3 
rigidity of trait judgment less T2 more =å‏ 
Rigidity of independent TAT less‏ .4 
than rigidity of color preference 4-5 ranks 5 or more‏ 
Rigidity of independent TAT /ess | less +3 ranks -3‏ .5 
than rigidity of annoyances (S)‏ 
Rigidity of independent TAT less | б or more А‏ .6 
than rigidity of trait judgment ranks +4‏ 
B. 1. Rigidity of aspiration (F) less than | If: If:‏ 
rigidity of aspiration (S) т rank 1-2 ranks‏ 
less +1 тоге =x‏ 
ranks 3-4 ranks‏ 2 
less +2 more —2‏ 
ranks 5-7 ranks‏ 3 
less +3 more "33‏ 
ranks 8 or more‏ 4 
less +4 ranks —4‏ 
More than‏ 
ranks +s‏ 4 


UU LZ ass 


C. т, 


D. Following pairs of tasks should be at 
same rigidity level* : 
т. Aspiration (Е) and indep 


3 
4 


ta 


* That is, within two ranks of each | 
inal score is equal to sum of posi 


. Self 


Rigidity in selecting pictures like 
self (unusual series) more than ri- 
gidity relative to self interests 

апа 
Rigidity in responding to P-F 
situations 


ТАТ interpretation у 
Aspiration (Е) and choice ТАТ in- 


terpretation 


. Aspiration (Е) and blot identifica- 


tion 


. Independent TAT and choice TAT 


interpretation 


. Choice TAT interpretation and 


blot identification 
interests. and 
others 


. Acceptance of "ordinary" pictures 
like the self and "unusual" pic- 


tures like the self” 


Same weights as used 


in (А) above 


Same weights as in (B) above 


endent 


interests of 


+4 


For each rank dif- 


ference beyond 
two, negative 
weights are as- 


signed as in (B) 
above 


other. i a 
tive and negative credits. 


22 SEYMOUR FISHER 


TABLE r4 


CORRELATIONS OF "BALANCE RIGIDITY Scores” 
AND RORSCHACH MALADJUSTMENT SCORES 


Total 


Paranoid Group 


Normal | Hysteric 


8 + .06 +.19 +.42 
SE (567) S.E. (.24) | S.E. (.23) | SE. (.11) 


formulated that the balance rigidity score was 
not a measure of maladjustment, as such; but 
rather a measure of present degree of disturbance 
in an individual's rigidity defenses—that is, a 
measure of how "comfortable" or "disturbed" 
he was with the particular style of defense or 
adjustment he was now utilizing. This would 
perhaps explain the deviant cases—since there 
is some logic in assuming that individuals who 
manifest very primitive Symptoms or extreme 
disorganization have ceased to struggle with 
their difficulties and are no longer subject to 
the tensions associated with struggle. 


To test this viewpoint, it was decided 
to continue the use of the Rorschach 
maladjustment score as a measure of 
present disturbance in adjustment. How- 
ever, it was first necessary to isolate and 
eliminate objectively all cases from the 
hysteric and paranoid groups that might 
be considered to have "settled" in rather 
regressive fashion for a specified neurotic 
or psychotic adjustment, -and whose 
Rorschach maladjustment scores would 
consequently not represent an adequate 
index of disturbance in defenses. This 
was done by eliminating any case in 
which the rigidity balance score implied 
that the individual was among the 
twenty per cent of least disturbed. indi- 
viduals in his group, but whose 
Rorschach maladjustment score ог 
Rorschach looseness score (whichever 
Was applicable) indicated that he was 
among the twenty per cent of most mal- 
adjusted or disorganized in his group. 
By this procedure three hysterics and 
two paramoids were eliminated. 


It is interesting to note that the three hys- 
terics selected have identical Symptoms, viz., 


they are all subject to generalized tremors or 
convulsive-like seizures involving the entire body. 
But it is most surprising of all to find that they 
are the only hysterics to manifest such behavior 
as their dominant symptomatology, These symp- 
toms are obviously of a more regressive and 
primitive character than the headaches, pains, 
and slight paralyses characteristic of the other 
hysterics in the group. The two paranoids elimi- 
nated were clinically both marked by extreme 
looseness. Their Rorschach protocols are per- 
vaded by an extreme amount of anal and sexual 
content (e.g, "bloody vagina,” "penis," "hem- 
moroid pretrusions") which was expressed 
quickly and without hesitation. One of the 
paranoids Spontancously remarked that she was 
"hearing voices" and both Were passively com- 
pliant during examination—all of which is in 
definite contrast to the rescrved hostile behavior 
shown by most of the other subjects in the 
paranoid group. In general, one seems justified 
in concluding that the cases eliminated in terms 
of the contrasting extremeness of their "balance" 
and Rorschach scores show certain clinical uni- 
formities that confirm earlier stated assumptions 
about the level of their adjustment. 


When correlations between rigidity 
balance scores and Rorschach maladjust- 
ment scores are determined on these 
"clean" groups, the correlation for the 
hysteric group shifts from +-.06 to +.46 
and the correlation of the paranoid 
group shifts from +.19 to +.72. The 
correlation for the total group shifts 
from +.42 to +.60. That is, if the 
rigidity balance score? is conceived as a 
measure of present disturbance in de- 
fenses it correlates with suitably in- 
terpreted Rorschach maladjustment 
Scores quite highly. 

The diagnostic Possibilities of the 
rigidity balance score and its usefulness 
in understanding the dynamics of indi- 
vidual cases will be discussed at a later 
point. 

Altogether, the data 
section may be said to 
ing implications: 


Presented in this 
carry the follow- 


*It may be mentioned 
do rigidity balance Score: 
to intelligence scores. 


here that in no instance 
5 show any relationship 
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TABLE 1$ 


SuMMARY OF CORRELATIONS BETWEEN RIGIDITY 
BALANCE SCORES AND, REINTERPRETED 
RORSCHACH MALADJUSTMENT SCORES 


Total 


Paranoid Group 


Normal Hysteric 


Prag, Luxus ue m. 
SE. Cop) | SE, Con | SE. (13) | SE. C09) 


1. They imply that the concept of a 
general trait or general factor of per- 
sonality rigidity is a fictional one. Al- 
though there appear to be certain gen- 
eral personality rigidity trends, to 
conceive of them in any simple uni- 
dimensional sense or even in a moder- 
ately complex trait sense is an over- 
simplification-of the facts. 

2. They point to the probability that 
personality rigidity varies at different 
levels of the personality structure. 

3. They imply that “good” or healthy 
adjustment is usually marked by certain 
optimum relationships among the rigid- 
ity levels in the personality structure. 

4. They imply that factors which lead 
to a disturbance in present adjustment, 
which require a difficult shift in rigidity 
defenses, result in a breakdown of opti- 
mum relations among the various ri- 
Bidity levels. 

5. Lastly, they indicate that in extreme 
cases where conflict and disturbance 1n 
defenses give way to a markedly regres: 
Sive, settled neurotic adjustment or re- 
Bressive-like psychotic adjustment, the 
various rigidity levels tend to resume the 
Optimum relations characteristic of 
"non-disturbed" adjustment. 


E. ANALYSIS OF INDIVIDUAL RIGIDITY 
PATTERNS 


In this section an attempt will be 
made to analyse the various kinds of 
rigidity patterns that appear in each of 
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the groups and: to relate such patterns 
to degree of maladjustment, clinical 
symptomatology, and the kind of de- 
scription of himself the subject gives on 
the STDCR. 

One of the chief problems in analysing 
the rigidity patterns of a range of sub- 
jects is that the large number of rigidity 
scores which must be considered simul- 
taneously in each case are difficult to 
conceive as a unity. The underlying 
pattern is concealed by the fact of com- 
plexity. In order to deal with this con- 
fusing complexity a number profile 
system has been worked out which gives 
a simplified picture of rigidity patterns. 
This system will now be briefly de- 
scribed. Four variables are considered in 
the course of picturing a particular rig- 
idity pattern: 

1. The degree of present disturbance in the 
pattern—i.e., its deviation from the "ideal" pro- 


file previously described in terms of the rigidity 


balance score. 

2. Generalized trends toward rigidity—as in- 
dicated by the total rigidity score. 

3. Degree of rigidity at level I as indicated 
by a representative score from level I—viz., the 
independent TAT interpretation score. 3 

4. Indications of unusually low rigidity when 
dealing with "easy" ог "peripheral" rigidity 
tasks—as measured by the number of certain 
level II and III tasks for which unusually low 
rigidity scores are obtained (see section D above). 


These rigidity variables may be con- 
verted into numbers (as will be described 
below) and the rigidity pattern repre- 
sented by a numerical profile in which a 
particular position is assigned to each 
variable. Thus: (see top of next page). 
In this way, by acquainting oneself with 
the significance of each position in the 
profile, one may grasp the general pic- 
ture of an individual's rigidity pattern. 

The basis for assigning numerical 
values to the various positions in the 
numerical profile is indicated by the 


Position One 
(degree of disturbance) 


Position Two 
(general rigidity 
trends) 


criteria set forth in Table 15А. 

In general, then, if one reads from left 
to right in any rigidity numerical profile 
one may note degree of pattern dis- 
turbance, general rigidity trends, degree 
of rigidity under emotional stress, and 
indications of degree of unusual periph- 
eral flexibility. 

Table 16 shows the total range of 
rigidity patterns that one might theo- 
retically obtain in terms of the four rig- 
idity variables in the profile. It can be 
seen that the actually obtained patterns 
tend to scatter over the whole theoretical 
range. However, one finds instances (e.g., 
1120 through 1131) where there are 
sudden gaps in the distribution. One can 
only speculate about the meaning of 
such gaps. It is possible that certain ri- 
gidity pattern characteristics are psycho- 
logically mutually exclusive. 

The patterns for any particular diag- 
nostic group cluster moderately, but 
there is much overlap among the groups. 
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Position Three 
(level I rigidity) а 


Position Four 


(indicator of unusually 
' low peripheral rigid- 
ity) 


to follow, a rigidity profile does not have 
an absolute significance, Rather, it must 
be interpreted relative to other variables, 
Such as clinical status, symptomatology, 
and degree of maladjustment. When 
rigidity profiles are interpreted in a 
relative sense, their distributions within 
the diagnostic groups reveal themselves 
to be psychologically meaningful and 
separable into sub-groupings that over- 
lap only very moderately. An analysis of 
the rigidity pattern distributions in the 
various diagnostic groups follows. 


1. Patterns in the Normal Group 


а. Rorschach test. If each subject's rigidity 
pattern is placed on a continum relative to the 
rank position of the subject's Rorschach mal- 
adjustment score (as shown in Table 17), some 
very interesting cluster effects appear, That is, 
one perceives that the rigidity profiles within 
certain ranges on the Rorschach maladjustment 
continuum show similarities to each other and 
are different from the profiles found at other 
points on the continuum. By inspection one is 
able to isolate four rather distinct groupings or 
clusters of profiles. The first of these. groups 


As will be demonstrated in the analysis — consists of the four subjects with among the 


TABLE rsA 
POSSIBLE VALUES FOR THE POSITIONS IN THE RiGriDITY PATTERN PROFILE 


те Position Two " Position Fe 
Position One i igidi Position Three =e 
(Pattern Disturbance) ME iy (Level T Rigidity) (Wousgual Peripheral 
Degree Degree Degree Low Rigidit 
of Dis- | Ranks |Value| ‘of Rank |Value| o Ranks |Valu ірһегају 
turbance Rigidity Rigidity "|. Pegiperal — Ranks Value 
High I-22 3 Very high I-10 4 TAT high I-20 
Moderate | 23-40 2 High 11-20 3 Moderate 21-40 2 AE ondes (S) IX. 23 
Low 41-00 | т | Moderate | 21-45 | 2 | Low 41-60 | r nnoyances (О) | ты 
Low 46-60 1 Trait Judgment 1-15 
spirationb (S 
sete ап» (S) 
Unusual Series) 
(Ordinary Series) 


е scoring here is 
ranks higher, this repre: 
Here the scorin; 
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TABLE 16 


TOTAL PossipL® RANGE OF RIGIDITY PROFILE PATTERNS, AND THOSE 
ACTUALLY APPEARING IN THE THREE EXPERIMENTAL GROUPS 


Profile № H P Profile N H P Profile N H P 
S E r6 3 I 2 2 r í 6 £ rx Ө 
I s Ж... 3 I «n. m ru 2 $ r x qm 2 
IT x to 2 £ 4 o 4 120 
1 т š t # X š$ r @ 1 I 
Y x 3 10 2130 a £ 3: ° 
te d X W ¥ 3 4 323 I 
їч 7 GO I 2 2%) > 3 2 x O 
I 7 = x I 2. 2 F 3 32 Y Т 2 
1220 2 a 3 2 H I з š š Ó 
x 2 2 i а 2 & 1 2 3 22 Y 2 3 
таз © 1 2250 32509 
Iz Z i I 2 2 3 £ $0 3 x 4 
1 3 I о 2 g re $3 % 9 2 2 
r. $ rou И EI X $ 23 Le 1 
i š 2 06 a d 2:19 3.5 2 0 
+ S а: X 1 “ОЛ. m 3 § m I 
D $ x w z š š $ 1 33 3 © 
E S $ 1 ee 3 I I 2 $ š 3$ t 2 
t x r o Т o š 4 X G 
I Ww 7 4 4 Loi $ 4€ 
I 4 2 o X 2420 1 3.4 20 
row 2 x 24 2 I 1 I 3 4 2 z 
I453 © I 2 4.4 О I $ 499 
Y & 3 I 2 4 £ 1 1 2 3 á 3 X I 


a N =number of normal subjects manifesting the indicated pattern; H=number of hysterics; 


P=number of paranoids. ç 
> Тһе number “l” here indicates that there may be one or more scores that imply unusually low 


peripheral rigidity; that is to say, in this table distinction is made only between the Presence (indicated 
by “r”) or absence (indicated by “о”) of unusual peripheral flexibility. 
TABLE 17 


RIGIDITY PATTERNS OF NORMAL SUBJECTS RANKED ACCORDING TO RORSCHACH MALADJUSTMENT 
ScoRES AND SEPARATED INTO "CLUSTER" GROUPS 


Group I Group II Group III Group IV 
Subject 7 Subject 12 Subject 4 Subject 13 
5 of 1 © 1220 a t x Š 3 112 
Subject 19 Subject 20 Subject то Subject 17 
E m Ii Т p 2 2 0 2 3 X X $ 1 2 
Subject 14 Subject r Subject тт Subject 3 
n sce I4 20 iL 2 Ж a 4 * Ë 
Subject 8 Subject 2 Subject 15 Subject 5 
nu rro (not classified) a x © 2 з 2 2 3 


r ¥ z © 


Subject 16 
rn rds 


Subject 6 
2 2 2 3 


Subject 18 
2 2 2 I 


Subject 9 
I I I 2 
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lowest Rorschach maladjustment scores. With 
but one minor variation their common rigidity 
pattern takes this form: 1 1 1 o. In terms of 
what has previously been said about the gen- 
eral significance of each of the profile positions, 
one may conclude that this 1 1 1 o profile is 
marked by low emotional rigidity; an absence of 
any generalized rigidity trends; and freedom 
from disturbance in the interrelations of the 
rigidity levels. Furthermore, one is not able to 
detect any erratic manifestations of unusually 
low peripheral rigidity. The whole pattern con- 
veys a picture of optimum distribution of 
rigidity defenses within the personality struc- 
ture. Stated in more positive terms, the in- 
dividuals in this group seem to show the fol- 
lowing rigidity characteristics: 

1. They apparently seem not to feel threat- 
ened by most situations—as is indicated by the 
fact that they have no need for maintaining any 
Sort of persistent anticipatory guarding or 
rigidity set, 

2. They seem to maintain 


their composure 
under emotional stres: 


s very well. In unusually 
demanding situations they mobilize quickly; for 
instead of retreating behind rigidity defenses, 
they mobilize increased flexibility for the emer- 
gency. 

3. Although they are capable of freeing them- 
selves from rigidity tendencies to a high degree 
when the occasion requires, they tend to main- 
tain a certain basic minimum peripheral rigid- 
ity that is free of variations in the direction of 
extreme flexibility. - 

The second pattern group? consists of three 
individuals who are, in general, moderately 
higher in Rorschach maladjustment than the 
individuals of group one. Their basic pattern 
takes this general form: 1 2 2 0. It is obviously 
а more guarded and rigid group than group one. 
Level I rigidity is moderately high and general 
rigidity trends vary from moderate to extreme. 
However, there is a complete absence of dis- 
turbance in rigidity level relations and no signs 
of unusually low variations in peripheral 
rigidity, This group is especially interesting be- 
cause it consists of individuals wi 
a healthy adjustment, 


but in terms of much 
more rigid defenses thai 


а more guarded fashion, 
ntly maintain an opti- 
peripheral and emotional 


apable of increasing their 
و‎ 


*Subject 2 is omit 
Classifications beca 
similarities to an. 
9n the maladjus 


ted from any of the pattern 
use her pattern Shows few 


Y ОЁ the patterns in its vicinity 
tment continuum, 


ho are making - 


customary level of flexibility when more impor- 
tant emotional situations are in question. This 
balance and ability te increase flexibility under 
stress are apparently the most important factors 
in the group's healthy adjustment; for as will 
be shown, many more maladjusted individuals 
display greater freedom from rigidity—but not 
the same degree of balance in the relations 
among the rigidity levels of the personality 
structure. 

The eight subjects in group three show con- 
siderable variation in their pattern profiles. One 
finds a range from 1 1 1 1 to 2 3 1 1. How- 
ever, they are in general distinguished from 
those in the groups preceding them by their 
consistent manifestation of unusually low ре- 
ripheral rigidity in position IV and frequently 
also by signs of moderate rigidity balance dis- 
turbance in position I. Persons in this group 
seem to have some difficulty with their present 
defenses. Although they show little generalized 
anticipatory guarding and little need to retreat 
to a more rigid position when faced with stress 
demands, they do not manifest the same co- 
ordinated unity in rigidity organization that is 
characteristic of the healthier normals in the 
other groups. They assign a greater importance 


to peripheral demands and irregularly they 
meet such demands with ir Y 


Since the phenomenon of unusually high 
flexibility mobilization in reaction to peripheral 
demands i 


d ; etc. These 
ions may be thought of 


1. On the one hand, some of them (e.g. 
Over-responsive raising of aspirations during 
success) seem to represent an attempt to secure 
easy satisfaction and success. They represent, 
as it were, an о take a peripheral 
E ES periphera] Context and its 
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remains a fairly peripheral type of threat for 
most subjects become an impertant threat for 
others and these latter deny their vulnerability 
by showing an extreme overwillingness, for ex- 
ample, to admit the possibility that all sorts 
of unpleasant (even psychoticlooking) indi- 
viduals are like themselves. In general, periph- 
eral mobilizations of this sort may be thought 
of as representing an over-sensitivity to periph- 
eral threat. 

_ The peripheral mobilizations in group three 
included both types; but the usc of peripheral 
activities for easy satisfaction predominates. 

The fourth group is one that contains the 
four most maladjusted normal subjects, ac- 
cording to the Rooschach test. The profiles in 
the group all have the same value in position 
I, values above o in position IV, and approxi- 
mately similar values in positions II and III. 
Most clearly, they have in common the fact of 
their extremely high disturbance in rigidity 
balance relations and their frequent unusual 
peripheral flexibility mobilizations. Actually, they 
are only more extreme examples of group three. 
They have even greater difficulty in coordinating 
their defenses and a more pronounced concern 
with peripheral demands. It is important to 
note that the-two most maladjusted subjects 
in this group show, in addition, a tendency 
toward moderate generalized rigidity trends. 
These two individuals are guarded and unable 
to increase their flexibility to meet efficiently 
situations with important emotional significance. 

b. STDCR. An attempt was made to relate 
STDCR scores to the above described groupings 
—with the following results: 

Group one.—Al of the individuals in this 
group give STDCR scores which attain or 
moderately exceed in "goodness" general popu- 
lation means. Three of the four subjects attain 
Or exceed the middle rank position for the 
normal experimental group. Furthermore, the 
same three subjects give STDCR variation scores 
that are in the upper twenty per cent of the 
normal group. That is, the well integrated 
healthy individuals of group one describe them- 
selves in favorable or "good" terms. However, 
it is not an extremely "good" self description 
that they give and it is not a description of 
extreme uniformity. Their evaluations of fac- 
tors STDCR vary in terms of the particular 
factor concerned. А 

Group two.—All of the individuals in group 
two give STDCR scores that cluster either at 
the mean or slightly below the mean of general 
population norms. However, all of their STDCR 
sums are below the middle rank position of the 
normal experimental group; and with but опе 
exception their STDCR variation scores are 
distinctly above the middle rank position of 
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the normal experimental group. 

Clearly, these subjects tend to give a relatively 
more unfavorable description of themselves—al- 
though they show the same ability as the sub- 
jects in group one to vary their evaluations from 
factor to factor. The fact that this group at- 
tempts to give only a relatively restrained picture 
of itself ties in well with the fact that its rigidity 
pattern is definitely a guarded restrained one. 
Apparently, the general style of defense reflected 
in the rigidity pattern reflects itself also in the 
kind of description of himself the subject 
presents. 

Group Three.—The STDCR sums here scatter 
widely and follow no aparent uniformity. Five 
of the eight STDCR variation scores fall below 
the middle rank position and suggest that the 
individuals in this group tend to be less dis- 
criminating than those in groups one and two 
relative to the single STDCR factors. That is, 
they seem to describe themselves in more gen- 
eralized "halo" terms. 

Group Four.—The STDCR scores of this group 
are quite uniformly above the gencral popula- 
tion means. Three of the four subjects have 
STDCR sums which are distinctly above the 
middle rank position of the normal experimental 
group. Furthermore these same three subjects" 
STDCR variation scores are in the lowest twenty 
per cent of the normal group. Generally, it 
would apear that group four individuals give 
a very one-sided "good" descriptions of them- 
selves. They do not discriminate well relative 
to the single STDCR factors. It may be postulated 
that such biased "favorable" self descriptions 
perhaps represent an attempt to conceal or deny 
the disturbance which is so apparent in both the. 
rigidity pattern and in the Rorschach material. 

Although one can speak only in terms of 
trends, there are indications from the above 
that when STDCR scores are interpreted rela- 
tive to rigidity patterns they take on a functional 
meaning. That is, one can understand the kind 
of description of himself the subject gives as a 
reflection of specific relationships and signs of 
disturbance indicated in his rigidity pattern. 


2. Patterns in the Hysteric Group 


The hysteric group may also be analyzed 
into typical patterns. Although pattern clusters 
in the normal group were verified entirely by 
similarities in Rorschach maladjustment scores, 
an additional variable (viz. clinical symptoms) 
will be used to aid such verification in the 
hysteric group. Each identified pattern will be 
described in turn? 


*Subject 32 is not included in any of the 
groups to be described below because there are 
data which tend to contradict a diagnosis of 
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TABLE 18 


SDIVIDUAL STDCR Scores, RANKS or STDCR TOTAL Scores} AND RANKS OF STDCR 
pon S NATION SCORES FOR THE NORMAL SUBJECTS IN EACH RIGIDITY GROUP 


Rank of Sum Tko 
j T D € R of STDCR ariation 
EE ái Scores? Scores? 
Rigidity Pattern 
Group One 
Subject 7 6 то 9 5 8 5:0 & 
19 7 7 6 4 8 10.5 5.5 
S 14 $ 7 7 6 5 14.0 15.5 
$ 8 5 10 6 5 7 8.0 5.8 
Rigidity 
Group Two 
12 8 3 6 4 8 16.0 I.5 
S 20 6 2 5 4 5 19.0 5.5 
S I 5 3 5 4 6 18.0 9-5 
Rigidity 
Group Three 
S 4 6 9 7 5 5 10.5 8.0 
S zo 7 5 8 6 6 10.5 11.5 
S тт 6 2 3 2 7 20.0 5.5 
S ag 6 6 7 5 6 14.0 15.5 
S 16 1 9 9 7 8 2.5 15.5 
5 6 8 7 9° 7 6 б.о 11.5 
5 18 6 5 7 € 6 14.0 15.5 
S o 8 7 9 6 9 4.0 9.5 
Rigidity 
Group Four 
S: 12 8 то 9 8 8 1.0 19.8 
S m 9 7 9 8 7 2.5 15.3 
3 6 6 8 6 8 1 n 
5 6 4 9 9 4 10.5 3.0 


* Each factor is represented by a C-score based on adults. 


b Highest rank for highest sum. a 
° Highest rank for greatest variation. 


Group one.—This group consists of five sub- 
jects who are among those hysterics showing 
least maladjustment on the Rorschach test. Clini- 
cally they all have similar symptoms. Thus, the 
chief complaint of each subject revolves about 
localized pain or disturbance in some part of the 
body, (e.g. heart palpitation, pain in stomach, 
pain in back, pain in legs). In this group both 
positions II and III (with one exception) are 


conversion-hysteria, First of 
toms consist chiefl 
of insomnia and som 


3 or higher. Aside from case 33, position IV is 
consistently O. Also the values in position I do 
not exceed 2. The members of the group tend 
to be rigid at all levels, "Their personality struc- 
tures are protected by an claborate series of high 
rigidity barriers, They show little capacity for 
mobilizing flexibility to meet stress situations 
having emotional 

are constricted 
guarded rigidit 
However, in none of these cases does the degree 
of disturbance 


in pattern consistency (as shown 
by the balance у ( 


are also absent 
Group two. 
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TABLE 19 


е 
RIGIDITY PATTERNS oF Conversion HysrERICS RANKED ACCORDING TO RORSCHACH 
MALADJUSTMENT SCORES AND SEPARATED INTO "CLusTER" GROUPS 


Group I 


Group II 


Group III 


Subject 32 
(not classified) s 2 
т r: к Š 


Subject 26 
2 š з ° з 4 


Subject 3o 


Subject 4o 


Subject 22 


Subject 38 


Subject 31 
4 t з - NI 


Subject 37 

3 3 1 I t 2 
Subject 23 

(not classified) 


T o4 3 © i а 3 3 
з & 2 © 

Subject 33 Subject 28 Subject 21 

т ¢ 4 2 é 2 f * uo So 
Subject 29 Subject 35 Subject 34 

. 4 $ o £ $ $ € 2 2 mg € 
Subject 39 Subject 27 Subject 24 

1 2 з © 5 à mg "7 E G з Я 
Subject 25 Subject 36 

(not classified) 3 3 3 3 Ы 

з 5 + 32 " 

jects whose Rorschach maladjustment scores — tremors or convulsions involving the entire body. 


cluster from rank five to rank eleven (rank one 
indicating the most maladjusted). In contrast 
with group one, it embraces those who manifest 
more serious and generalized symptoms (e.g. 
Bencralized body pains, persisting vomiting, and 
dizzy spells. It includes what might be called 
the larger “middle group" of hysterical subjects. 
The patterns. typical of the group are marked 
by values of 2 or higher in position I, a value 
of g (with one exception) in position III, and 
а value of 1 or higher in position IV. These 
patterns are all similar in that they picture 
individuals who manifest strong rigidity trends, 
but whose rigidity defenses are in a state of 
high disturbance. The group two hysterics show 
about as much and sometimes less rigidity than 
the healthier hysterics; but they lack the same 
coordinated unity. Their rigidity is less consistent 
(in terms of balance). On the one hand, they 
are even less capable of mobilizing increased 
flexibility to face important demands; and on 
the other hand they are extremely more prone 
to bursts of non-functional peripheral flexibility. 
This is another indication of how much more 
important the organization factor is than mere 
highness or lowness of general rigidity trends. 
. Group three. Group three includes five sub- 
Jects: 

1. Four hysterics with extreme maladjustment 
Scores who show very severe clinical symptoms. 
Three of these subjects’ symptoms center about 


One subject has had periods of mutism; feels 
extremely nervous; and manifests tremor and 
numbness in one arm. 

2. Group three includes also one subject whose 
Rorschach score places her below the mean of 
the hysteric group in maladjustment. However, 
despite her moderately good showing on the 
Rorschach test, she has manifested such symp- 
toms as mutism, paralysis of lower limbs, brief 
convulsive spells, and periods of amnesia. The 
value of the rigidity pattern is particularly em- 
phasized in this instance; for it classifies this 
subject with the other hysterics showing most 
serious symptoms—even though the Rorschach 
fails to do so. The rigidity patterns typical of 
the group are most characterized by position 
ПІ values which do not fall below 2 and by 
position II values (with two exceptions) which 
also do not fall below 2. Clearly, these hysterics 
are much less rigid than those that have al- 
ready been described. They are less protectively 
guarded, less constricted by general rigidity bar- 
riers; and do not show the same tendency to 
retreat to rigidity when exposed to emotional 
stress demands. However, all of their rigidity 
patterns give evidence of considerable internal 
disturbance in terms of balance scores or in- 
appropriate peripheral flexibility. The fact that 
this group manifests the poorest maladjustment 
scores and the most extreme symptoms, taken 
in conjunction with the fact that it is the least 
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- TABLE 20 


VIDUAL STDCR Scores, RANKS or STDCR Тотал, SconES, AND Ranks or STDCR 
A SCORES FOR THE CONVERSION HYSTERIC SUBJECTS IN EfCH RIGIDITY Group 


Берер вих 
j T D © R> о R ariation 
шее S Scores? Scores? 
Rigidity Pattern 
Group One 
Subject 26 dos 2 o o 4 20.0 10.0 
3° 4 8 6 5 7 3-5 8.o 
S зз 3 5 5 5 4 8.5 14.5 
S 29 5 5 4 3 5 8.5 14.5 
S 39 3 7 2 3 I 15.5 2.5 
Rigidity 
Group Two 
Subject 40 5 10 7 6 6 1.0 б.о 
S 22 4 7 5 5 5 6.0 12.0 
38 3 4 2 2 5 15.5 12.0 
S 28 4 6 5 5 I 10.0 2.5 
S 35 4 6 4 3 2 12.5 8.0 
O 6 6 4 3 8 5.0 4.5 
ө 36 І 3 I 2 3 19.0 16.5 
Rigidity 
Group Three 
Subject 31 5 5 3 3 4 11.0 16.5 
37 5 3 3 3 5 12.5 18.0 
ar 3 2 2 2 4 17.0 19.0 
S 34 5 6 4 4 4 7.0 19.0 
S 24 T 5 3 3 o 18.0 4.5 


* Each factor is represented by а С-ѕсоге based оп adults. 


^ Highest rank for highest sum. | 
° Highest rank for greatest variation. 


rigid group of hysterics, leads one to postulate 
the following: As hysterics become sicker and 
more maladjusted, they are less able to meet 
their problems in terms of defenses based on 
evasive rigidity and these increasingly inadequate 
defenses begin to break down into a looser less 
organized form. 

The relations of STDCR scores to the hysteric 
rigidity pattern groups are not so clear as they 
were relative to the normal subjects. Most of 
the trends that occur are most apparent when 
groups one and two are contrasted with group 
three. The hysterics of groups one and two 
show no uniformity in the kinds of descriptions 
they give of themselves; but the hysterics of 


group three give a very unfavorable description 
of themselves. Four out of five 


in this gro 
obtain total STDCR scores below t Bb 


all tend to give 
below the mean 
thermore, while 


he middle rank . 


hysterics show Very low variation 
in their STDCR sc 
the group three h 
description of themselves, b i 

description in a uhi spon d ainsi 


does not show 


Broups one and 
edness, their re- 
Y when faced with prob- 
group three shows much 


treat to evasive rigidi 


ет is that the group one 
ot as ill and maladjusted 
terics; and so perhaps do 
у disturbed as the latter 
planation based purely on 
° ent would not provide an 
and landing ОЁ why certain зене bony one 
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patterns (but not the very best maladjustment 
scores) give descriptions of thefhselves which ex- 
ceed in "goodness" the general population means. 


3. Patterns in the Paranoid Group 


The paranoid rigidity patterns cluster along 
the Rorschach maladjustment continuum into 
two very distinct groups. Within these groups 
there are certain subsidiary patterns which will 
also be noted. 

Group one.—The profiles for this group are 
characterized by values of 2 or more in positions 
II and Ш and values of 1 or more (with a 
single exception) in position IV. Group one 
includes the fifteen subjects who have the lowest 
Rorschach maladjustment scores among the para- 
noids. These subjects tend to manifest rclatively 
moderate to extremely high rigidity trends. In 
almost all instances they show themselves to be 
guarded and unable to mobilize increased flexi- 
bility when faced with stress or emotional de- 
mands, Their rigidity patterns correspond to 
the concept of the "rigid paranoid." Marked 
disturbance in the organization and balance of 
the pattern is clear in almost every instance. 
This disturbance shows itself either in terms of, 
poor rigidity pattern balance scores or in the 
form of inappropriate peripheral flexibility. 

Certain interesting pattern variations appear 
at the two maladjustment extremes of the cases 
in group one. Thus, the three least sick indi- 
viduals in the group are associated with these 
patterns: 3 2 2 1,9 3 2 2, and 2 2 2 о. They 
are less completely rigid and less subject to pe- 
ripheral flexibility mobilizations. They do not 
retreat so strongly behind rigidity defenses when 
it is necessary to deal with a stress situation. At 
the other more maladjusted extreme of group 
one are also found individuals whose rigidity 
patterns tend to be less guarded and constricted 
then the middle segment of the cluster; but 
they, in contrast, show significantly more dis- 
turbance in balance scores and more peripheral 
irregularity than the least sick paranoids. In the 
One instance decreased rigidity implies less pa- 
thology and in the other it seems to imply more 
Pathology, 

Group two.—This group consists of the five 
Paranoids who show the highest degree of malad- 
justment and looseness on the Rorschach test. 

he characteristic patterns have in common 
Values of 1 in position III and (with one excep- 
tion) values of о in position IV. These paranoids 
display much less rigidity than one finds as- 
sociated with the other paranoids. Several do 
Manifest rather high rigidity trends when deal- 
ing with situations that have only peripheral 
importance to them, but they make little attempt 
to guard themselves in stress situations by using 


TABLE 2r 


RIGIDITY PATTERNS OF PARANOID SCHIZOPHRENICS 
RANKED ACCORDING TO RorsHACH MALADJUST- 
MENT SCORES AND SEPARATED 
INTO “CLUSTER” GROUPS 


Group I Group II 
Subject 55 Subject 59 
3 2 2 I ê GIT 
Subject 53 Subject 42 
4 $ # š 1 
Subject 43 Subject 48 
а 2 26 KE RE 
Subject 6o Subject 47 

2 3 2 ТТ 
Subject 44 Subject 46 
Do 3 3 XN 


de 
Subject 57 
4 13 3 ж 
Subject 50 
Su 2 "57508 
Subject 56 
I 44 44 
Subject 41 
3 э Заа 
Subject 49 
2944 wu Y 
Subject 54 
k dod d 
Subject 52 
2 4 3 X 
Subject 45 
a9 153 
Subject 58 
5.2 3 9 
Subject 51 
a2 2 3 


rigidity barriers. Indeed, relative to materials 
with emotional significance their flexibility is 
so extreme as to warrant the use of the term 
“Joose.” In contrast to the group one paranoids 
who, it may be postulated, are still seeking to 
defend themselves against their chief problems 
by a constriction process, these group two para- 
noids have tended to drop their defenses and 
seem to have given up trying to handle their 
problems actively. Of course, this last supposi- 
tion applies in different degrees. Thus, two of 
the cases (subjects 42 and 47), do show an almost 
complete absence of rigidity at any level and 
furthermore manifest few signs of disturbance 
about the loose unrealistic level of adjustment 
to which they have regressed. But three subjects 
(59, 48, and 46) are able to maintain considerable 
peripheral rigidity when not subjected to stress 
demands; and furthermore their balance scores 
imply considerable disturbance. 

Analysis of the STDCR scores in the two 
paranoid rigidity pattern groups indicates that 
with but a few exceptions the individuals in 
both of these groups tend to give scores higher 
(“better”) than the general population means. 
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TABLE 22 


INDIVIDUAL STDCR Scores, RANKS or STDER Torar Score: 


AND RANKS OF STDCR VARIATION 


SCORES FOR THE PARANOID SCHIZOPHRENIC SUBJECTS iN EACH PtiGIprry Group 


Rank of Sum Кайсы 
j D G Ra of STDCR ariation 
св е a Scores? Scores? 
Rigidity Pattern 
Group One 
Subject 55 3 9 8 то 2 1-5 2:9 
Š 53 4 3 3 3 4 15.0 19.5 
Š X3 6 4 6 6 4 13.5 14.0 
S бо 6 7 8 7 š б.о 12.0 
5 5 6 4 3 13.5 4.0 
S p 8 9 8 8 5 2.9 6.0 
S so 5 8 9 9 6 3.0 б.о 
S 56 5 10 8 8 o 9.0 1.0 
S a 2 5 3 3 I 19.5 9.o 
S uo 9 10 10 9 6 1.0 б.о 
S s 4 5 5 5 4 15.0 18.0 
S 52 5 9 7 6 3 10.0 3.0 
S 45 4 2 3 3 2 19.$ 15.5 
S 58 7 7 9 7 5 5.0 9.0 
S s 5 8 7 6 6 7.5 13.0 
Rigidity Pattern 
roup Two 
-Ó 59 5 6 6 5 5 12.0 19.5 
S 42 7 8 8 60 7 4.0 15:5 
S 48 4 7 3 4 4 16.0 11:0 
S 4 5 3 3 3 4 17.0 17.0 
S 4 6 4 8 7 4 II.O 9.0 


* Each factor is represented by a C-score based on adults. 


b Highest rank for highest sum. : 
* Highest rank for greatest variation. 


That is, most of the paranoids give an at least 
moderately "good" descriptions of themselves, 
When applied to group two this fact might 
superficially seem to contradict what has been 
indicated about the looseness of defenses in that 
group; but if one assumes that a "good" self 
description represents only a peripheral kind 
of defensive effort, the apparent contradiction 
disappears, A comparison of groups one and two 
entirely in terms of the paranoid STDCR scores 
alone reveals that: 


а. Nine of the fifteen subjects in group one 
have total STDCR scores which equal or exceed 
the middle rank position of {Не paranoid group; 
whereas 

b. Four of the five subjects in group two 
have total STDCR scores below the middle rank 
position of the paranoid group. 

€. Ten of the fifteen subjects in group one 
exceed the middle rank STDCR variation score 
of the paranoid Broup; whereas 

d. Four of the five subjects in 
below the middle rank STDCR 
9f the paranoid group. 


Apparently, the more rigid guarded paranoids 


group two fall 
Variation score 


not only tend to exceed the general population 
in the favorableness of their self descriptions but 


also to exceed the less rigid paranoids in this 
respect. However, despi 


ugh re s 'cen the amount of dis. 
criminative variation shown in answering STDCR 
questions and degree of maladjustment, 


4. Summary of Section 


In general terms, the following has 
been shown in this section: 


4. Each subject’s ri 
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TABLE 23 


SUMMARY TABLE SuowixG FOR THE THREE 
GROUPS THE DISTRIBUTIONS OF RIGIDITY PAT- 
TERN PROFILES, RANKED ACCORDING TO CORRE- 
SPONDING RORSCHACH MALADJUSTMENT SCORES 


Paranoid 


Rank Normal Hysteric 


3314 
IIIO 
3310 
IIIO 
3310 
3223 
3235 


оом Сил + Oo DH 


1322 
2220 
то 3122 
ІІ 2431 
12 3223 
13 2432 
14 2421 
15 3232 
16 3332 1232 
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18 1112 
I9 3223 
20 3332 

21 3233; 2331 a 
22 2332 


24 3232; 2421 
26 2431 

27 2220 
28 3433 3232 


30 1230; 3231 


33 3212 
34 aor? 


35 2430 
36 2331 

3221 
38 3225 
39 3112 


41 1112;3112 


44 2221; 2223 
46 1113; 2112 
48 IIII; 2311 


5o 2112 1430 
51 IIIO 


53 1420 
54 1220 
55 2330 
56 1220 
57 тїт 


59 I2II 
бо IIIO 


scores, clinical symptoms, and STDCR 
scores. 

b. Four rigidity patterns may be dis- 
cerned in the normal group. These are 
listed in the order of their degree of 
maladjustment (from low to high): (1) 
A low rigidity group marked by a high 
degree of balance in level relations. (2) 
A more constricted group whose rigidity 
pattern is however characterized by 
balance and unified coordination. (3) A 
group showing few generalized rigidity 
trends, but characterized by moderate 
disturbance in rigidity balance, and 
tendencies toward inappropriate mobili- 
zations of peripheral flexibility. (4) A 
group marked by extremely high dis- 
turbance in rigidity balance. 

c. 'Three rigidity patterns may be Че- 
tected in the hysteric group. These 
patterns are meaningful in terms of 
Rorschach maladjustment scores and 
also clinical symptoms. (1) А very con- 
stricted pattern group with only moder- 
ate rigidity balance disturbance. Clini- 
cal symptoms in this group are limited 
to localized pain. (2) A constricted group 
with an extremely high degree of rigidity 
balance disturbance and much unusual 
peripheral flexibility. Clinical symptoms 
are of a more general character. (g) A 
relatively less rigid, looser group which 
also manifests much disturbance in ri- 
gidity balance. Clinical symptoms are 
very extensive and serious (e.g. pro- 
longed convulsive seizures). 

d. Two rigidity patterns appear in the 
paranoid group. These are listed in their 
order of Rorschach maladjustment (low 
to high): (1) A highly constricted rigid 
group manifesting much disturbance in 
rigidity balance and a tendency toward 
inappropriate peripheral flexibility. (2) 
A loose group chiefly characterized by its 
low degree of defensive guarding. 

e. The self description given by a sub- 
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ject on the Guilford STDCR test has a 
functional relationship to the pattern of 
his rigidity defenses. However, it is a 
relationship that varies with the clinical 
group in which it occurs. Thus, normal 
subjects whose rigidity defenses are in a 
State of disturbance and who manifest 
unusual peripheral flexibilities give a 
very favorable self description. In both 
the hysteric and paranoid groups the 
most favorable self description is given 
by those inidivaduals who are the best 
organized and whose rigidity defenses 
are in a state of tight constrictive guard- 
ing. Least favorable self descriptions are 
given by those whose rigidity is dimin- 
ished and whose rigidity patterns tend 
toward looseness and "breakdown." 


F. DIAGNOSTIC DIFFERENTIATION 


The previous analysis has indicated 
that rather distinct sub-groups can be 
identified within each of the principal 
experimental groups. The question 
arises as to the possibility of using ri- 
gidity scores as an independent means of 
diagnosis—i.e., as a means of distinguish- 
ing normals from persons identified psy- 
chiatrically as sick and also of distin- 
guishing persons in different psychiatric 
categories, Specifically, can rigidity 
scores be used to differentiate the nor- 
mals from the hysterics and paranoids? 
Furthermore, can the hysterics be differ- 
entiated from the paranoids? 

The rigidity balance score does not 


‚ successfully give such differentiation. 
Even in terms of mean differences the 
normals are only moderately well sepa- 
rated from the other groups (although 
these differences are statistically signifi- 
cant) and the hysterics are not signifi- 
cantly differentiated from the paranoids. 

In an attempt to work out a technique 
that would be more successful for diag- 
nostic differentiation, several scoring 
systems were attempted. One such scor- 
ing technique based upon a series of 
penalty weights for all instances of un- 
usually high or low peripheral rigidity, 
all instances of unusually low ability to 
mobilize flexibility for meeting stress 
situations, and all instances of unusual 
differences in rigidity for tasks at the 
same rigidity level proved to be moder- 
ately differentiating. It differentiated the 
normal subjects from the hysterics and 
the paranoids excellently. However, it 
failed to distinguish the hysterics from 
the paranoids successfully. That is, it was 
possible to separate those who seem to be 
making a normal adjustment from sub- 
jects who are clinically diagnosed as 
abnormal. However, since the primary 
purpose of this study is an analysis of 
rigidity phenomena and only very sec- 
ondarily a consideration of diagnostic 
differentiation, the rather complicated 
weighting system evolved for the above is 
not described. 

In general, the ri 


gidity battery seems 
to h 


ave greatest potential usefulness in 
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cases where other material (e.g., clinical 
behavior or psychological test results) 
identify an individual's illness rather 
definitely as being neurotic in contrast to 
psychotic—or vise versa. In such cases (as 
Shown in the section of analysis of indi- 
vidual patterns) the rigidity scores are 
valuable for perceiving the kind of de- 
fenses an individual is relying on; how 
disturbed or unstable these defenses are; 


and how near they lie to the point of 
extremely serious disorganization, Thus, 
for example, one would have a better 
understanding of the seriousness and 
prognosis of a case of conversion hys- 
teria if it were known whether the ri- 
gidity pattern was dominated by tight 
constriction, or unusual degree of periph- 
eral flexibility, or perhaps tendencies 
toward looseness. 


CHAPTER V š 
GENERAL DISCUSSION OF DATÁ 


S IN many other studies which at- 

tempt to get at some aspect of per- 
sonality, the problem of specificity-gener- 
ality has arisen prominently. That is, 
the question has been raised whether the 
rigidity phenomena here observed can 
possibly be explained either in terms of a 
"general rigidity" concept or in terms 
of specific, rather unrelated reactions to 
particular stimuli. The character of the 
results obtained imply that personality 
rigidity manifestations cannot accurately 
be described either in very specific terms 
or in very general terms. Neither single 
rigidity task scores nor gross summations 
of many scores have proved to be mean- 


ingful explanatory. In contradiction to ' 


such simple explanatory approaches, one 
is struck by the degree of complexity in 
the interrelations of the various rigidity 
measures. It is possible to note simul- 
taneous correlated, contradictory, and 
complementary rigidity trends in the 
same subject. Indeed, aspects of the overt 
rigidity behavior of an individual may 
imply just the opposite of what is true 
with regard to that individual's basic 
rigidity pattern. For example, essentially 
rigid conversion hysterics or disturbed 
normals who are becoming increasingly 
rigid often display flashes of unusual 
flexibility when dealing with certain 
kinds of situations. 

The data here obtained suggest that 
there is a reasonable basis for assuming 
that there are at least two roughly de- 
fined rigidity levels in any given per- 
sonality structure—an ego level and a 
peripheral level. 'That is, if one observes 
the range of rigidity shown by individ- 
uals in meeting different kinds of situa- 
tions, it becomes rather apparent that 


one can at least differentiate the degrec 
of rigidity shown to situations that are 
not emotionally threatening from the 
degree of rigidity shown to those situa- 
tions that do threaten the individual or 
somehow do call into question his self- 
esteem. The type of rigidity shown in 
reacting to the first kind of situation may 
be designated as peripheral or outer ri- 
gidity; whereas the second type of reaction 
may be called ego or "inner" rigidity. 
If one postulates that the character of 
peripheral (outer) rigidity represents 
how much need the individual has to 
guard himself even in non-threatening 
situations and further assumes that ego 
(inner) rigidity represents the degree of 
defensive guarding an individual mobil- 
izes when faced with emotional demands 
or threats, it may then be hypothesized 
that the pattern of relationships between 
these two levels gives us at least a vague 
picture of the manner in which a given 
individual's personality defenses are or- 
ganized. 

In these terms the defenses of a nor- 
mal undisturbed person are marked by a 
series of balanced relationships between 
the inner level and the outer level. Their 
peripheral defenses usually manifest a 
certain basic minimum of rigidity; and 
the ego defenses show a consistent ability 
to mobilize a greater degree of flexibility 
than is habitually shown by the periph- 
eral defenses. This relative relation is 
very important. There are several in- 
stances in which undisturbed normals 
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demonstrate considerable rigidity at both 
levels; but these individuals are consis- 
tently able to call up a degree of flex- 
ibility for meeting stress situations that 
exceeds their customary outer rigidity. 
If such a normal person encounters prob- 
lems or stresses that he cannot handle 
adequately with his current rigidity de- 
fenses, interesting adjustive changes be- 
gin to show up in his rigidity behavior. 
Disturbances in the peripheral defenses 
begin to appear. The balance of moder- 
ate to moderately high rigidity that pre- 
viously characterized this level gives 
Way to erratic fluctuations that vary 
from high rigidity to extremely high 
flexibility, Many situations that were 
previously adjusted to in terms of cus- 
tomary outer defenses can no longer be 


handled that way. The individual seems „ 


to be uncertain as to just how much he 
can count on his outer defenses and con- 
sequently many formerly “peripherally 
handled" situations become involved 
with ego defenses—with resultant erratic 
Variations in the kind of rigidity shown 
at the peripheral level. As the individ- 
ual's disturbance becomes even greater, 
his ego defenses become more seriously 
affected, They shift sharply in the direc- 
Hon of greater rigidity. They tighten 
and the ability to mobilize increased 
flexibility for meeting emergencies dim- 
Inishes significantly. 

_ Passing from the normal to the conver- 
Sion hysteric, one finds a rigidity defense 
Pattern which is only a more extreme 

orm of that seen in the very disturbed 
Es тта], However, in these hysteric cases 
rigidity at both defense levels is usually 
E greater, and the outer defenses are 
ma. s disturbed. But most out- 
ifs. s of all is the fact of extremely 
that Ka ego-level constriction; the fact 

ress demands seem to call forth 


only evasive retreat behind rigidity bar- 
riers. It is interesting to note that some 
few of the hysterics are characterized 
by a very rigid defense pattern and yet 
show only moderate peripheral distur- 
bance and few signs of pattern instabil- 
ity. These are perhaps individuals who 
have worked out a neurotically con- 
stricted defense system capable of bear- 
ing with some efficiency the pressure of 
their current difficulties. In the most 
radically disturbed conversion hysterics 
one perceives a sharp shift in the pattern 
of rigidity defenses. An uncoordinated 
loosening of both the outer and inner 
defense levels appears. Disturbance per- 
sists in the form of erratic peripheral 
fluctuations and lack of balanced rela- 
tionship between ego and peripheral bar- 
riers; and the individual no longer 
makes much attempt to deal with situa- 
tions by retreating behind his rigidity 
defenses. At this stage of disturbance 
defenses dominantly based on retreat to 
rigidity are apparently no longer suffi- 
cient. Transition to a looser more re- 
gressive type of defense system seems to 
be in progress. This latter assumption is 
confirmed by the fact that regressive 
symptoms (e.g. generalized. convulsive 
seizures) begin to appear in such cases. 

Those paranoid schizophrenics who 
retain considerable control in their be- 
havior and who are still strongly motiv- 
ated to conceal and rationalize their 
psychotic thinking seem to make much 
use of rigidity in both their outer and 
inner defenses. They have a set to deal 
with situations in terms of minimum 
involvement. In general, however, their 
defenses tend not to be so completely 
closed and rigid as is the case with the 
middle group of conversion hysterics. 
There are more instances in which the 
ego defenses of these paranoids are not 
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completely rigid barriers. But since their 
defenses are almost always in a state of 
high disturbance one can only interpret 
the lesser rigidity as a sign of greater 
vulnerability to disturbing stimuli. Para- 
noids who clinically (and in terms of 
psychological measures) seem to be sick- 
est and most disorganized no longer 
make much genuine use of rigidity bar- 
riers. Although many of them have outer 
defenses which are very rigid, they do 
not seem to have any real inner defenses. 
This is demonstrated by the fact that 
demands which manage to evade their 
outer defenses or which are too stressful 
to be blocked peripherally, call forth 
very loose, obviously psychotic, reactions. 
Such reactions can rarely be called forth 
in the paranoid whose rigidity pattern 
indicates much rigidity at the ego level. 

Altogether, this preliminary attempt 
to sketch in semi-precise terms the kinds 
of personality defense changes that ac- 
company different degrees of personality 
maladjustment seems to give a meaning- 
ful consistent picture. 

One very interesting problem that the 
above analysis of diagnostic groups 
brings to the fore has to do with the 
meaningfulness of our present diagnos- 
tic categories. Even if one grants that 
certain of the experimental subjects were 
incorrectly diagnosed, the fact remains 
that each of the experimental diagnos- 
tic groups includes many persons whose 
personality defenses seem to be organized 
in significantly different patterns. For 
example, among the conversion hysterics 
there are defense patterns marked by 
extreme rigidity at all levels and con- 
trastingly there are patterns which are 
loose at all levels. The fact that these 
rigidity pattern differences are also fre- 
quently accompanied by radical differ. 
ences in Symptomatology leads one to 


think critically about the rationale of 
present classification schemes. At present, 
of course, psychiatric diagnosis is pri- 
marily a matter of evaluating symptoma- 
tology; but one may ask how valid such 
a procedure is if it leads to the grouping 
together of individuals whose personality 
defenses are apparently quite different. 
If future work with other fairly precise 
personality measures should confirm the 
kind of data here obtained about diag- 
nostic groupings, serious criticisms 
might be raised against these groupings. 
It is obvious that important errors in 
treatment and understanding may result 
from assuming that a conversion hysteric 
“type A” is equivalent to a conversion 
hysteric “type B,” or that a paranoid 
schizophrenic "type A" is equivalent to 
a paranoid schizophrenic "type B." The 
actual isolation of such types in a very 
exact sense could probably be achieved 
if a body of data were systematically ac- 
cumulated about a range of personality 
defense phenomena, as has been done 
here relative to rigidity phenomena. The 
specific diagnostic value of the present 
approach to rigidity seems to lie in the 
fact that it can give valuable information 
relative to variations within diagnostic 
categories. Although it is true that when 
used alone the rigidity battery is at 
present able to distinguish only normals 
from abnormals, it is capable of giving 
very interesting insights into individual 
cases when used in conjunction with 
other materials (e.g., the Rorschach test). 
It may be added that the rigidity battery 
seems to have a high degree of sensitivity 
for detecting disturbance in normals. If 
further work with normals confirms this 
point, it is possible that an abbreviated 
form of the battery would prove of value 
10 certain types of selection work, 

Many of the points that have emerged 
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from the data have already been implicit 
or explicit in previous work. Thus, the 
whole viewpoint that has been expressed 
here with regard to levels of rigidity and 
the distinction between outer and inner 
rigidity has run through the thinking 
of psychologists and psychiatrists for 
several decades. It has been particularly 
marked in the thinking of the Lewin- 
lans, whose claborate personality-struc- 
ture scheme is prevaded with concepts 
like “sub-wholes of the personality 
Structure," "ego barriers," and “periph- 
eral barriers." Certain of the very speci- 
fic findings of the present study (e.g. 
With regard to the rigidity of the con- 
Version  hysteric and the paranoid 
schizophrenic) have also been anticipated 
in previous work (Angyal, Shakow, and 
Rosenzweig). Actually, the value of the 
material presented in this paper lies not 
$0 much in its specific novelty, but rather 
in the fact that it represents at least an 
milial integration and confirmation of 
past isolated guesses and findings about 
rigidity behavior. Furthermore, it has 
value insofar as it has rationalized a par- 
ticular set of measuring procedures that 
are based on a systematic point of view. 
Vague speculations about 80 and 
peripheral defenses or changes that occur 
in these defenses under differing condi- 
tions are now subject to some kind o 
quantitative checks, even though they be 
Only crude approximate checks. 

In view of the importance that has been 
Placed on the concepts of “ego” and “peripheral” 
TiBidity in this study, it is pertinent that the 
conclusions reached here concerning them be 
Telated to previous work in the same area. The 
most extensive past experimental consideration 
of these concepts occurs in the series of “task- 
Interruption" studies done by Rickers-Ovsiankina 
Ga) and Bennett (5). Rickers-Ovsiankina found 
hat schizophrenic subjects tended not to build 


"D “completion tensions” relative to neutral 


tasks—as exemplified by the fact that they gen- 


erally failed to resume such tasks following the 


removal of interposed interruptions. From such 
data she concluded that the outer peripheral 
boundaries of the schizophrenic are so fluid that 
tensions raised by neutral tasks are dissipated 
before they can form firmly segregated systems. 
Bennett then demonstrated that if schizophrenics 
are interrupted while performing tasks with real 
emotional significance to them, they manifest 
strong “completion tensions.” She reasoned that 
this must imply that emotional materials have 
an opportunity to excite segregated tension sys- 
tems because there are rigid barriers around the 
schizophrenic's ego arca that prevent dissipation 
of tensions. That is, she demonstrated a real 
difference in the rigidity of ego boundaries as 
contrasted. with peripheral boundaries. Further- 
more, she (5, p. 241) noted that “їп the central 
layers of the [normal] person, . . - communication 
[between systems of the central layers] seems to 
be even greater than that for peripheral systems," 
In a broad sense the results of the present study 
have been congruent with these interpretations 
based on task-interruption data. Thus, the ego- 
peripheral rigidity distinction has demonstrated 
itself clearly. Furthermore, it is particularly in- 
teresting that Bennett's conclusions regarding the 
olesser rigidity in normal subjects of ego areas 
as contrasted. with peripheral areas has also 
been confirmed herc. ` 
However, there is one important point on 
which the present study stands in disagreement 
with the conclusions of Rickers-Ovsiankina and 
Bennett. This is relative to the first investigator's 
acceptance of the idea that the peripheral 
boundaries of schizophrenics are in general fluid 
and the second investigator's statement that the 
ego boundaries of schizophrenics are in general 
rigid. In terms of the complex variations in 
ego and peripheral rigidity found here in para- 
noid schizophrenics the above conclusions are 
misleading simplifications. Even within the 
limited range of a paranoid schizophrenic group 
one finds many cases of high peripheral rigidity, 
low peripheral rigidity, and also cases of marked 
ego fluidity and ego rigidity. As one reviews the 
data of Rickers-Ovsiankina and Bennett, one 
feels that a similar complexity actually char- 
acterized their results. However, it would appear 
that in the process of emphasizing and sharpen- 
ing group comparisons they leaned toward ex- 
cessive simplification. Their failure to underscore 
the complexity of rigidity phenomena may pos- 
sibly also be attributed to the following: (1) The 
fact that their methods of measurement dealt 
with such a limited sampling of behavior. (2) 
The fact that they both assume that failure 
to develop completion tensions is due only to 
the absence of barriers necessary to prevent the 
dissipation of tensions. If one assumes that 
failure to develop completion tensions may be 


40 SEYMOUR FISHER 


due also to failure of tasks to excite motivation 
by penetrating rigidity barriers, the possibility 
of a more complex interpretation of their data 
arises. That is, one should at least accept the 
possibility that failures to develop: completion 
tensions may have alternative causes. On the one 
hand it may in some cases be due to high 
fluidity and in other cases it may be due to 
high rigidity. у 

Altogether, there seems to be justification for 
questioning the generalizations about peripheral 
and ego rigidity in schizophrenics that have been 
offered by Rickers-Ovsiankina and Bennett. 


While working with the present rigid- 
ity battery certain problems and diffi- 
culties have become apparent which sug- 
gest several future research possibilities 
on the rigidity question. 

1. First of all, it is clear that it would be 


desirable to work out a more balanced series of 
rigidity tasks—that is, a series in which ego-level 


tasks would not be so outnumbered by peripheral 
tasks, and in Which at least one other defense 
level is identified ^etween the peripheral and 
ego levels. Such refining of the rigidity battery 
would perhaps lead to more precise generaliza- 
tions about rigidity relations, and possibly permit 
better discrimination of clinical groups. 

2. Since this study was limited to women of 
three specified kinds of groups and of a particu- 
lar range of intelligence, it is of course necessary 
to determine whether the results obtained apply 
to a wider range of subjects and how they are 
affected by extremes of intelligence. 

3. Another interesting possibility would be a 
study of changes that occur in rigidity patterns 
when attempts are made to produce shifts in 
adjustment during different kinds of therapy. 
Thus, one might measure changes following 
electric shock. This approach might better our 
understanding of what kinds of defense reorgan- 
izations occur during such procedures. It would 
also be of special interest to work out the possible 
prognostic value of the rigidity measures. 
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CHAPTER VI 
GENERAL SUMMARY OF FINDINGS 


HE findings described below are 

based on a study of sixty women of 
average intelligence—twenty normal sub- 
Jects, twenty conversion hysterics, and 
twenty paranoid schizophrenics. The 
chief points that emerge most clearly 
from the study may be summarized as 
follows: 

1. No single rigidity measure based on 

any one task is successful either as a 
measure of “general” personality rigidity 
trends or of personality characteristics 
that might be thought of as related to 
rigidity, 
Ш>: Analysis of the relationships obtain- 
ing between total rigidity scores (a sum- 
mation of many individual rigidity 
Scores) and various other indices (e.g.. 
Rorschach rigidity scores and clinical 
classification) of personality rigidity indi- 
cate that the total rigidity score is a mod- 
€rately good measure of what might be 
called “general” rigidity trends. How- 
ever, it has also been shown that the 
Concept of general rigidity as a real en- 
tity is to a considerable degree fictitious 
and that one should probably not antici- 
pate more than very moderate success in 
attempting to measure such a variable. 

3. It has been shown that in general 
conversion hysterics and paranoid schizo- 
Phrenics are more rigid than normal sub- 
Jects. However, there are no clear differ- 
ences between the conversion hysterics 
and paranoids. 

4. It appears to be fruitful to conceive 
of rigidity behavior as correlated with a 
Series of "defense levels" in the personal- 
ity structure, The degree and character 
of the rigidity shown by an individual 
In dealing with a given situation seems 
to vary in proportion as that situation 
Poses a.threat or raises serious adjust- 


ment demands. In the normal personal- 
ity certain optimum "ratios" are found 
to exist if one compares the amount of 
rigidity shown when dealing with non- 
threatening (peripheral) tasks as opposed 
to the amount of rigidity shown for 
threatening (ego involving) tasks. If the 
ratio or relation of ego level rigidity to 
peripheral level rigidity is for each indi- 
vidual expressed as a quantitative score 
indicating degree of departure from an 
ideal optimum ratio, correlations of such 
scores with certain measures of personality 
defense disturbance (e.g., the Rorschach 
and clinical symptoms) are significantly 
high. 1 
5. Intelligence, in the formal psycho- 
cmetric sense, seems to have no clear re- 
lationship to the character of an 
individual's rigidity pattern—i.e., the 
relations obtaining between the rigidity 
of the peripheral defenses and ego de- 
fenses. 

6. It is clear that personality rigidity 
may manifest itself in diverse, often un- 
recognized ways. Indeed, internal pcrson- 
ality rigidity may show itself externally 
in a form that appears to be just the 
opposite of rigidity and even scems to 
imply extreme flexibility. Thus, numer- 
ous examples have been found in the 
data of subjects who are fearful and 
highly rigid in adjusting to ego involved 
situations; but who mobilize an unusual 
amount of unnecessary flexibility in ad- 
justing to peripheral unimportant tasks. 
This non-functional апа superfluous 
peripheral flexibility seems to represent 
a false "emergency reaction," a compen- 
satory effort stimulated by the insecurity 
associated with an overly rigid ego de- 
fense system. 

7. A “rigidity profile’ may be con- 


41 


42 SEYMOUR FISHER 


structed for each subject. This profile 
defines the nature of an individual's 
rigidity behavior in terms of four vari- 
ables: 

(a) The degree to which the individ- 
ual's ratio of peripheral to ego rigidity 
departs from an ideal "normal ratio." 

(b) 'The degree to which the individ- 
ual's reactions seem to reflect a general- 
ized rigidity that cuts across a wide 
range of response. 

(c) The degree of rigidity shown when 
dealing with situations of a threatening 
or ego significance. 

(d) The degree to which the individual 
seems to mobilize unusually great or 
unnecessary flexibility in meeting situa- 
tions not usually conceived as ego in- 
volving. 

It has been shown that within each 
of the experimental groups the "rigidity 
profiles" of subjects fall very logically 
into clusters. Those clustering together 
have approximately the same Rorschach 
maladjustment scores and display very 
marked similarities in clinical symptoma- 
tology. i 

8. The variety of "rigidity profiles" 
falling within the same diagnostic 
groups raises some doubt as to the ra- 
tionale of present diagnostic classifica- 
tion schemes. Such questioning would 
have a more valid basis if confirmed by 
studies of other personality characteris- 

tics making use of at least moderately 
precise measures. 

9. There are indications that the de- 
scription which a subject gives of himself 
on the Guilford STDCR is functionally 
related to the kind of "rigidity profile" 
manifested by him. That is, those sub- 
Jects whose "rigidity profiles" fall into 
the same clusters because of similarities 
in pattern tend also to give similar re- 
sponses to questions about themselves on 


the STDCR. For example, alí those nor- 
mal subjects whose "rigidity profiles" arc 
marked by unusual deviations from the 
ideal "normal ratio" of ego to peripheral 
rigidity and also by frequent unnecessary 
mobilizations of flexibility in the face 
of non-threatening tasks give very favor- 
able descriptions of themselves. They 
may be contrasted with those normal 
subjects whose "rigidity profiles" show 
no such disturbance and who correla- 
tively do not give extremely favorable or 
"good" STDCR descriptions of them- 
selves. In general, the indications are 
that it is not possible to detect the 
clusters of subjects with similar STDCR. 
Scores or to explain the logic of such 
clusters without using the "rigidity pro- 
file" as a frame of reference. 

10. It is possible in a statistically sig- 
nificant manner to differentiate the nor- 
mal subjects from the abnormal subjects 
by use of various single rigidity indices 
(e.g., the total rigidity score or the ratio 
of peripheral to €go rigidity). However, 
this kind of differentiation is only on a 
rough group basis and not of sufficient 
clarity to permit the use of such measures 
as individual diagnostic tools in the con- 
ventional sense of the present diagnostic 
System of classification. The greatest 
“diagnostic” contribution of the rigidity 
battery seems to be toward detailed dif- 
ferentiation within clinical groups. Thus, 
by use of the “rigidity profile’ it has 
been shown that one can with consider- 
able accuracy discriminate disturbed 
normal subjects from well adjusted nor- 
mal subjects (as defined by the Ror- 
schach), conversion hysterics with limited 
localized Symptoms from hysterics with 
more serious generalized symptoms, and 
Well protected paranoid schizophrenics 


from more seriously disorganized para- 
noid schizophrenics, 
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VIGOTSKY SCORING 


To score the Vigotsky sortings the following 

weighting system was used. 
, 5=A basic variation. This included all sort- 
ings in which the total range of blocks was 
divided into groups on the basis of size or shape 
or color or number of sides. 

3 = For any variation in which the subject 
attempts to put one of each “kind” in each of 
the sorted groups. Thus, relative to form, this 
Would be illustrated by a series of groups each 
Containing a square, circular, and hexagonal 
block. Relative to color, it would be illustrated 

Y а series of groups cach containing five repre- 


sentative colors. 

— A minor variation. Any repetition of a 
previous basic sorting—but with at least some 
minor change in the previous distribution of 
the blocks. 

1 — Concrete variation. Any use of the blocks 
to make concrete objects or to build designs in 
jig saw fashion. 

The above weighting system is based on clini- 
cal experience with the Vigotsky and is intended 
to assign each sorting to a level corresponding 
roughly to the complexity of analysis or abstrac- 
tion involved. 


APPENDIX II 
TAT SCORING 


The following weighting system was used to 
Score TAT interpretations. 

2=A basic! interpretation which involves all 
of the pictured persons and revolves about а 
Plot in which the characters do something or 
are acted upon, 

,*- This weight also given for an interpreta- 
Чоп which employs the same characters as а 
Preceding basic interpretation—but which re- 


volves about a plot in which the characters do 
something different or are acted upon differently. 

1 — Given for an interpretation with different 
role takers than a preceding interpretation—but 

(with the same plot or kind of action. 

1 = Given for any basic interpretation which 
ignores any outstanding figure on the card. 

o = For mere card description or rejection of 
a card. 


APPENDIX III 
ASPIRATION SCORING 


The following weighting system was used to 


evaluate aspiration changes that occurred follow- 
ing arbitrarily reported achievement scores. The 


table below is the basis for deriving three aspira- 
tion rigidity scores, as described in Chapter III, 


section B, 10. 
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In a few cases where the character of the 
subject's expressed aspirations made it impossible 
to obtain four trials in the "success" series, the 


three obtained trials were equated to four by 
adding the average of the major trend (with 
fractions disregarded). 


APPENDIX IV 
SCORING FOR RORSCHACH RIGIDITY 


Each of the weights below is a penalty for 
what is considered to be excess rigidity or re- 
strictiveness. The larger the final summation of 
weights, the greater is the implied rigidity. 


Е per cent (Foy, 
(1) If the number of responses is 22 or fewer— 
(a) An F% of 55-60 gives a score of 3. 
(b) An F% of 61-70 gives a score of 4- 
(c) ^n F% of 71-80 gives a score of 6. 
(d) An F% of 81-90 gives a score of 8. 
(e) An F% of 91-100 gives a score of 11. 
(2) If the number of responses is over 22— 
(a) An F% of 55-60 gives a score of 4. 
(b) An F% of 61-70 gives a score of 5. 
(c) An F% of 71-80 gives a score of T. 
(d) An F% of 81-90 gives a score of g. 
(е) An F% of gt-100 gives a score of 12. 


Nunmber of Responses (R) 
(1) If the number of responses lies between 
o and 15, inclusive, the score is 9. 
(2) If the number of responses lies between 
16 and 20, inclusive, the score is 6. 


Percentage of Animal Responses (A) 
(1) If the percentage of "animal" responses is 
60-70, the score is 6. 
(2) If the percentage of "animal" responses is 
71-80, the score is 8. 


(3) If the percentage of "animal" responses is 
81-90, the score is 10. 


Form Accuracy (F+) 
(1) If the number of responses lies between 
15 and 22, inclusive— 


(a) And if the F% is at least 40 and not 
higher than 5o— 


An Е--%, of 85-90 gives a score of 5. 
An FL% of 91-100 gives a score of 8, 
(b) Or if the F% is 51 or higher— 

An F+% of 85-90 gives a score of 6. 
An Е--%, of 91-100 gives a score of 9. 


If a record contains fewer than fi 


fteen respons 
only one-half of any given weight B 


applies. 


„1 For three of the cards @, IIT, and 
given basic variation or any other variation could 
earn only one credit, This Was done because of 
the ease with which most Subjects worked out 
interpretations for these Cards that concealed 
rigidity tendencies on more “difficult” cards, 


УШ) a 


(2) If the number of responses exceeds 22— 

(a) And if the F% is at 35 and not higher 
than 5o— 

An F+% of 85-9o gives a score of 6. 

An F+% of 91-100 gives a score of 10. 

(b) Or if the F% is 51 or higher— 

An F+% of 85-90 gives a score of 6. 

An Е--%, of 91-100 gives a score of 12. 


Card Turning 
Less than two responses in which the card is 


viewed other than in the upright position gives 
a score of 4. 


Color 


Less than two FC responses is scored 6, (M.FC 
1s not counted as an FC) 


Whole Responses (W) 

(1) If the percentage of W responses is 40-50— 
(а) The score is 5 in records where the num- 
ber of responses is 2» or under. 

(b) The score is 6 in records where the num- 
ber of responses exceed 23. 

(2) If the W% is 51-60— 

(а) The score is 7 where the number of re- 
Sponses is 29 or less, 

(b) The score is 9 where the number of re- 
sponses is over ээ, 

(3) If the W% is 61 or over— 

(a) The score is io where the number of re- 
Sponses is 22 or less. з 

(b) The score is 19 Where the number of re- 
Sponses is over 22, ч 

Small Detail Responses (Dd) 

(1) If the ра, is 18-23— 

(a) The score is 3 where the number of re- 
sponses is under 22. 

(b) The score is 4 where 
sponses is over 92, 

(2) If the ра is 24-30— 

(а) Тһе Score is 5 where the number of re- 
Sponses is 22 or under, 

(b) The score is 6 where 
sponses is over ээ, 

(3) If the Dd% is 31-40— 

(а) The score is 8 where 
Sponses is 22 or under, 


(b) The Score is g where the number of re- 
sponses is over 22, 


(4) If the Dd% is 41 or over— 


the number of re- 


the number of re- 


the number of re- 
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(а) The score is 10 where the number of re- 
sponses is 22 or under. € 

(b) The score is 11 where the number of re- 
sponses is over 22. 


Movement Responses (M) 
(1) IE number of movement responses is less 
than two the score is 15. 
(2) If the number of movement responses is less 
than one the score is 20. 


Content 


If four or more responses fall into the same 
Content category (aside from "human" and 
animal" responses)— 


(а) And if the number of responses is 1 
through 25 the score is 7. 

(b) And if the number of responses is 26 
through 4o the score is 6. 

(c) And if the number of responses is 41 or 
over, the score is 5. 

Average Time Of Initial Response (T /r R) 

If average reaction time per initial response is 

(a) 25 through 29 seconds, the score is 2. 

(b) 30 through 4o seconds, the score is 5. 

(c) Over 40 seconds, the score is 7. 

Shading Responses (FY) 
Less than two FY responses gives a score of 3. 


APPENDIX V 
SCORING FOR RORSCHACH MALADJUSTMENT 


Each weight given below is a penalty for 
What is considered to be an indicator of person- 
ality maladjustment. 


Small Detail Responses (Dd) 
_ (1) If the Dd% is 18-25 inclusive, the score 
15 2, 
(2) If the ра, is over 25, the score is 8. 


Whole Responses (W) 
(1) If the W% is 30-35 inclusive— | 
‚ (а) The score is 6 if the number of responses 
18 16-95 
25. 
i (b) The score is 7 if the number of responses 
5 26 or higher. 
(2) If the W% is 36-40 inclusive 
о 15 3 d 
‚ (а) The score is 8 if the number of responses 
is 16-25, 
is 0 "The score is 10 if the number of responses 
26 or higher. 
(3) If the W% is 41 or higher— $ 
i (а) The score is 12 if the number of responses 
Š 17 or over. 
is (b) The score is 8 if the number of responses 
14-16, 
is (с) The score is 6 if the number of responses 
11-13, 
M (d) The score is 4 if the number of responses 
9:10. 
ove penalty 
half of the 
un- 
-and 


only one-half of any of the ab 
Ы is given if more than опе- : 
jj ses in the record are not minus ОГ 
С). *rentiated type of color responses (viz, CF. 


Е 

i ne thermore, except within the range 41% ог 
iti CT, only one-half of a penalty weight ose 
than ig. umber of responses in the record is les 


(4) 1t the W% is less than 15, the score is 2: 


(5) If the W% is less than 10, the score is 3. 

(6) If one-fourth or more of the W's in a 
record lie beyond the middle of the response 
sequence of the particular cards on which they 
occur, the score is 3. 

(7) If the sequence is confused, the score is 5. 


Movement Responses (M) 

(1) If the number of M's is less than g— 

(a) The score is g if the number of responses 
is 18 or less. 

(b) The score is 5 if the number of responses 
is 19 or more. 

(2) If the number of M's is less than 2— 

(а) The score is 10 if the number of responses 
is 18 or less. 

(b) The score is 12 if the number of responses 
is 19 or more. 

(3) If the number of M's is less than 1— 

(а) The score is 14 if the number of responses 
is 18 or less. 

(b) The score is 16 if the number of responses 
is 19 or more. 

(4) I£ an M does not occur on card III, the 
score is 6. 

(5) One M of bad form quality (M minus) 
equals a score of 3. 

(6) Two M minus responses equal a score of 
10. 
(7) Four M minus responses equal a score of 
12 


(8) If 2-3 M’s occur in small details, there is 


a score of 4. 

(9) If 4 ог more M's occur in small details, the 
score is 7. 

(10) If there are over two M's in a record and 
if half or more are found beyond the middle 
of the cards on which they occured, the score 


is 5. 


46 SEYMOUR FISHER 


Loose Color or Color of Poor Quality 
(С, CF—, FC—) 

(1) A pure color response (C) without any 
form is equal to a score of— 

(a) 3 on card X. 

(b) 2 on cards II or III. 

(c) 2 on card VIII. 

(d) 1 on card IX. 

(2) A color response with some minimum form 
but of poor quality (CF—) and containing either 
blood content, fire content or anatomy content 
gives— 

(a) А score of 8 on cards II, ПІ, or IX. 

(b) A score of 4 on cards VIII or X. 

(3) Minus FC responses 

(à) One FC minus response equals a score 
of 2. 

(b) Two FC minus 
of 6. 

(c) Three or more FC minus responses equal 
à score of 8. 


responses equal a score 


Absence of Color 

(1) If color response involving a good integra- 
tion of form and color (FC+) 

(а) Is absent on cards II and III, the score 
is 6. 

(b) Is absent on cards УШ, IX, апа X, the 
Score is 9 (if FC given as last response on X, 
the score is 4). 

(2) No color in the total 
score of 4. 

(3) If there is only one FC in the total record 
and the rest of the color is C and CF minus, 
the score is 8. 


record gives the 


Color Naming (CN) 
(a) CN on one card equals a score of 12. 


(b) CN on two or more cards equals a score 
of 16. 


(1) If F is from 57 to 64 inclusive— 

(a) An R of 15 through 20 (if less than 15 R, 
at least 9 of Tesponses should be in 
category) will Bive a score of 5 for an F% 
55; and a score of 7 for an F% of 56 and о 

(b) An R over 20 will give 
4 for an F% of 85-45. 

6 for an F% of 46-55. 
8 for an F% of 56 and over, 

bi РЯ F+ from 50 through 56 

4) Ап R of 13 thr ill gi 
wie ae Fe of een 20 will give a score of 
о Of 41-50. 

0 Of 56 and over. 
10 for an F% ol ea TER Yd 
15 for an F% of 41-50. 

20 for an Fo of 56 and Over, 


(8) IE FL is under во. 


(a) An R of 10 through 13 will give a score of 
8 for an F% 0735-40. 
10 for an F% of 4 -5o. 
15 for an F% of 51 and over. 
(b) An R of 14 through 20 will give a score of 
12 for an F% of 35-40. 
18 for an F% of 41 50. 
25 for an F% of 51 and over. 
(c) An R over 20 will give a score of 
18 for an F% of 35-40. 
25 for an F% of 41-50. 
30 for an F% of 51 and over. 


Very Poor Form Quality 
If a response is F minus, is a severe departure 
in form from the card area used, and contains 
very peculiar or fantastic content (c.g., “people 
catching blood in a cup," "dirty uterus," "inside 
of a womb"), that response is scored 5. 


Number of Responses 
(1) If there are less than оо responses, the 
score is 2, 
(2) If there are less than 15 responses, the 
score is 5. 
(3) If there are less than 


12 responses, tha 
score is g. 


Average Time of Initial Responses (T/1R) 


(1) If the TAR is 25 through 55 seconds, the 
Score is 2, 


(2) If the TAR is 36 seconds or greater, the 
score is g. 
Rejection of Cards 
(1) Rejection of card I or IV or VI or IX 
equals a score of 7, 
(2) Rejection of card If or HI or V or УП 
or VIII or X equals a score of 9. 
. (8) Rejection of two cards in sequence results 
in an extra penalty of 5. (No more than two 
rejections, one for each of above categories, is 
counted), 
Shading Responses 


z (1) 1 to 2 pure shading responses (Y) is scored 


a 3 to 


(3) 5 or more pure Y responses is scored б. 


(4) 2 to 3 shading responses with little form 
(YF) is scored э, 


(5) 4 or more YF responses is scored 5. 
White Space Responses (S) 


(1) 4 055 responses is scored 9, 
(2) 6 or more § Tesponses is scored 4. 


Popular Responses (P) 
(IË less than 12 R in record do not score for 


P) 


(1) If the record has 12-15 R, 
(a) Less than IP responses is scored 2. 
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(b) Less than 2P responses is scored 4. 

(2) If record has 16-25 R, € 

(а) Less than 4P responses is scored 4. 

(b) Less than 2P responses is scored 6. 

(3) If the record has 26 or more R, 

(а) Less than 4P responses is scored 5. 

(b) Less than ар responses is scored 8. 
a (4) If the “animal” P response on card VIII 
15 missed, the score is 3. 

(5) If the "bat" p response 
Missed, the score is 3. 


on card V is 


Sex and Anatomy Content 


(1) 1-2 sex (includes embryo responses) or overt 
‘nal responses is scored 5. 

(2) 3 or more sex or anal responses is scored 0. 

(3) More than 3 anatomical responses is scored 
3 (does not include bonc responses, but does 
include pelvic responses). 


Other Content 

(1) If the "animal" percentage is higher than 
бо, the score is 8. 

(2) If the "animal" percentage is lower than 
30, the score is 4. 

(3) If 5 or more botany responses are present, 
the score ig 3. 

(4) If 4 or more landscape responses are pres- 
Cnt, the score is 3. 

(5) If 3 or more “food” responses are present, 
the score is 4. 
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(6) If less than 2 "human" responses are pres- 
ent, the score is 5. 

(7) If the number of part "human" responses 
(Hd) exceeds by 3 to 1 (and at least one H is 
present) the number of complete "human" re- 
sponses (H), the score is 5. 


Minus Cards 
If a card has at least 3 responses and the 
minus responses exceed the plus responses (no 
color responses count except FC minus) there 
is a score of 3, on each such card. 


Sex Confusion 
If a subject verbalizes obvious conflict about 
the sexual classification of a person or object 
(a) On at least two responses, the score is 4. 
(b) On three responses, the score is 5. 
(c) On more than three responses, the score 
is 8. 
Bizarre Logic 
(1) А contaminatory response is scored 20. 
(2) А definite pathological DW is scored 10. 


Preseveration 


If the same response is given on three suc- 
cessive cards and the response in each case is 
a minus of very poor quality, the score is 15. 

. . . 

The total maladjustment score is equal to 

the sum of the penalty weights given. 


APPENDIX VI 


SCORING FOR RORSCHACH LOOSENESS 


The following are the maladjustment signs 
«nd Corresponding weights) used in determin- 
NE the “looseness” score. 

Poor form quality (ie, low F+). — — 

Confused sequence in order of giving re- 
POnses, 


opie tamination or similar kinds of bizarre 
їс. 


Color Naming (CN). 


Very poor form quality responses. 
Poor quality sexual or anal responses, 
Poor quality movement responses. 
1 M minus response is scored э. 
2 M minus responses are scored 4. 
3 M minus responses are scored 8. 
4 M minus responses are scored 10. 
Perseveration of any of the kinds described 
is scored 15. 
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Loose Color or Color of Poor Quality 
d (C, CF—, FC—) 

(1) А pure color response (C) without any 
form is equal to a score of— 

(a) 3 on card X. 

(b) 2 on cards П or Ш. 

(c) 2 on card VIII. 

(d) 1 on card IX. 

(2) A color response with some minimum form 
but of poor quality (CF—) and containing either 
blood content, fire content or anatomy content 
gives— 

(a) A score of 3 on cards II, III, or IX. 

(b) A score of 4 on cards VIII or X. 

(3) Minus FC responses 

(а) One FC minus response equals a score 
of 2. 

(b) Two FC minus responses equal a score 
of 6. 

(c) Three or more FC minus responses equal 
a score of 8. 


Absence of Color 

(1) If color response involving a good integra- 
tion of form and color (FC+) 

(a) Is absent on cards If and III, the score 
is 6. 

(b) Is absent on cards VIII, IX, and X, the 
score is 9 (if FC given as last response on X, 
the score is 4). 

(2) No color in the total record gives the 
score of 4. 

(3) If there is only one FC in the total record 
and the rest of the color is C and CF minus, 
the score is 3. 


Color Naming (CN) 

(а) CN on one card equals a score of 12. 
nio CN on two or more cards equals a score 
of 16. 

Form Quality (F+) 

(1) If F+ is from 57 to 64 inclusive— 

(a) An R of 15 through 20 (if less than 15 R, 
at least 9 of responses should be in the F 
category) will give a score of 5 for an F% 45- 
55; and a score of 7 for an F% of 56 and over. 

(b) An R over 20 will give a score of 
4 for an F% of 35-45. 
6 for an F% of 46-55. 
8 for an F% of 56 and over. 

(2) If F+ from бо through 56— 

(а) An R of 13 through 20 will give a score of 
7 for an F% of 35-40. 

10 for an F% of 41-5o. 
12 for an F% of 56 and over. 

(b) An R over 20 will give a score of 
10 for an F% of 35-40. 

15 for an F% of 41-50. 
20 for an F% of 56 and over. 

(8) If F is under 5o— 


(а) An R of 10 through 13 will give a score of 
8 for an F% 0i-35-40. 
10 for an F% of 4-50. 
15 for an F% of 51 and over. 
(b) An R of 14 through 20 will give a score of 
12 for an F% of 35-40. 
18 for an F% of 41-50. 
25 for an F% of 51 and over. 
(c) An R over 20 will give a score of 
18 for an F% of 35-40. 
25 for an F% of 41-50. 
go for an F% of 51 and over. 


Very Poor Form Quality 
If a response is F minus, is a severe departurc 
in form from the card area used, and contains 
very peculiar or fantastic content (e.g., “people 
catching blood in a cup,” “dirty uterus,” “inside 
of a womb"), that response is scored 5. 


Number of Responses 

(1) If there are less than 20 responses, the 
score is 2. 

(2) If there are less than 15 responses, the 
score is 5. 

(3) If there are less than 12 responses, tha 
score is g. 

Average Time of Initial Responses (T/1R) 

(1) If the ТЛЕ is 25 through 35 seconds, the 
score is 2. 

(2) If the T/1R is 36 seconds or greater, the 
score is 3. 


Rejection of Cards 

(1) Rejection of card I or IV or VI or IX 
equals a score of 7. 

(2) Rejection of card II or III or V or VIT 
or VIII or X equals a score of 9. 

(3) Rejection of two cards in sequence results 
in an extra penalty of 5. (No more than two 
rejections, one for each of above categories, is 
counted). 

Shading Responses 


(1) 1 to 2 pure shading responses (Y) is scored 
2 


(2) 3 to 4 pure Y responses is scored 4. 

(3) 5 or more pure Y responses is scored 6. 

(4) 2 to 3 shading responses with little form 

(YF) is scored 2. 

(5) 4 or more YF responses is scored 5. 
White Space Responses (S) 

(1) 4 to 5 S responses is scored 2. 

(2) 6 or more S responses is scored 4. 

Popular Responses (P) 


(If less than 12 R in record do not score for 
P) 

(1) If the record has 12-15 R, 

(а) Less than 4P responses is scored 2. 


х 


єч 
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(b) Less than 2P responses is scored 4. 

(2) If record has 16-25 R, 

e Less than 4P responses is scored 4. 

(b) Less than 2P responses is scored 6. 

(3) If the record has 26 or more R, 

e Less than 4P responses is scored 5. 

(b) Less than 2P responses is scored 8. 
0 If the "animal" P response on card VIII 
15 missed, the score is 3. 
(5) If the "bat" P response on card V is 
missed, the score is $. . 


Sex and Anatomy Content 
a qe sex (includes embryo responses) or overt 
al responses is scored 5. 
(2) 3 or more sex or anal responses is scored 9. 
(3) More than 3 anatomical responses is scored 
Tes not include bone responses, but does 
е pelvic responses). 


Other Content 
n (1) If the "animal" percentage is higher than 
0, the score is 3. 
(2) If the "animal" percentage is lower than 
39 the score is 4. 
ü (3) If 5 or more botany responses are present, 
1€ Score is g. 
& (4) 1t 4 or more landscape responses аге pres- 
nt, the score is 3. 
m3À If 3 or more “food” responses arc present, 
€ score is 4. 


(6) If less than 2 “human” responses are pres- 
ent, the score is 5. 

(7) IE the number of part “human” responses 
(Hd) exceeds by 3 to 1 (and at least one H is 
present) the number of complete “human” re- 
sponses (H), the score is 5. 


Minus Cards 
If a card has at least 3 responses and the 
minus responses exceed the plus responses (no 
color responses count except ЕС minus) there 
is a score of 3, on each such card. 


Sex Confusion 
If a subject verbalizes obvious conflict about 
the sexual classification of a person or object 
(а) On at least two responses, the score is 4. 
(b) On three responses, the score is 5. 
(c) On more than three responses, the score 
is 8. 
Bizarre Logic 
(1) А contaminatory response is scored 20. 
(2) A definite pathological DW is scored 10. 


Preseveration 
If the same response is given on three suc- 
cessive cards and the response in each case is 
a minus of very poor quality, the score is 15. 
LJ . . 
The total maladjustment score is equal to 
the sum of the penalty weights given. 


APPENDIX VI 


SCORIN 


TS following are the maladjustment signs 

in, corresponding weights) used in determin- 
30 ‘looseness” score. 
Gan form quality (i.e. low FE. | 

spo nfused sequence in order of giving те- 
nses, 

gie mination or similar kinds of bizarre 


Color Naming (CN). 
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Very poor form quality responses. 
Poor quality sexual or anal responses. 
Poor quality movement responses. 
1 M minus response is scored 2. 
о M minus responses are scored 4. 
3 M minus responses are scored 8. 
4 M minus responses are scored 10. 
Perseveration of any of the, kinds described 


is scored 15. 
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e CHAPTER 1 


THE NATURE AND LIMITATIONS OF THIS STUDY 


A. THE NATURE OF THE STUDY 


" 
[= source of this investigation lies in 
the need in educational psychology 
for exactitude of concept formation. The 
need in the case of this present research 
arises from what seems to be a current 
trend—a trend illustrated by a recent 
text, which after presenting certain gen- 
eralizations concerning the oralsilent 
reading relationship, states that “the oral 
reading examination is indispensable to 
à thorough exploration of a student's 
reading abilities" (8, p. 238) The re- 
Search interest of such a statement lies 
in the fact that these writers do not offer 
experimental findings to establish their 
Contention, nor is supporting evidence 
at the academic levels for which they 
Write (high school and college) to be 
found in the literature. In fact, the only 
Justification for the statement which this 
Writer could discover is a conclusion by 
Swanson (who did not use such a test in 
his Study) that an oral reading test "such 
as Gray's Oral Reading Paragraphs" 
Should be useful at the college level (15, 
P- 59). A derived conclusion such as this 
latter would seem hardly justification for 
the twenty-seven pages which the text 
Quoted devotes to an analysis of various 
oral reading errors and their implications 
Or reading-diagnosis and for tutoring. 
Such generalizations concerning the 
diagnostic value of oral reading exam- 
Tis raise a problem of importance 
observable activity, and thus allows more 
Extensive Р 2: irecti and 
observation, direction, ап 
sue urement than do the more covert 
igh ud habits, the finding of a 
€gree of relationship between the 


two would lend logic to further examina- 
"tiom of the assumption that difficulties 
in the oral medium might be paralleled 
by similar difficulties in the silent 
medium, and that improvement in one 


„might result in. improvement in the 


other. Experimental verification of con- 
clusions such as that quoted in the open- 
ing paragraph, would provide workers 
in the field of reading difficulties with 
a useful instrument for reading diagnosis 
and for tutoring. : 

Outlined briefly below are those areas 
with which this present study will Ье 
concerned. 

e 
1. Studies of the Frequency 
of Oral Errors 


Such studies seem to be based upon 
the assumption that the student who 
makes the most frequent errors when 
reading orally thereby indicates his gen- 
erally low level of reading proficiency. 
Opposed to this notion, Marion Monroe 
(9) and Cassie Payne (11) several years ago 
asserted that both good and poor readers _ 
make all kinds of oral errors, scemingly 
indiscriminately, when level of reading 
proficiency is determined by some’stand- 
ard other than the action which is itself 
being observed—namely, the frequency 
of the making of oral errors. However, 
in spite of such objections, studies of 
error frequency haye continued to ap- 
pear with ever more detailed analyses 
of errors and of error positions. If oral 
errors do not show a substantial degree 
of correlation with other tests of lan- 
guage skills, the value of studies of oral- 
error frequency will be open to ques- 


tion. 
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2. The Relationship between Reading 
Comprehension and Knowledge of 
Sentence Elements as Indicated 
by Oral Phrasing 


Notions of such a relationship run 
like a thread through the literature. For 
example, one finds Sangren stating some 
years ago that "at times the teacher will 
ask the child to read orally and will dis- 
cover that he has no recognition of the 
sentence as а unit of thought . . .” (18, p. 
100). Gray tells us that “їп addition the 
examiner is often impressed with many 
otner defects in oral reading . . . and 
lack of proper phrasing" (6, p. 323). So 
also we have Thorndike emphasizing the 
"weighing of each of many elements in 
a sentence" (16, p. 329). And recently, 
Vera Paul has stated that it is lack Af 
proper phrasing which keeps many chil- 
dren from getting meaning from the 
sentence (10, p. 219). Such statements 
4re of interest in view of the fact that 
this writer has been unable to discover 
any published research findings to sup- 
nort them. 


3. The Relationship between Oral 
Errors and Vocabulary 


fhis relationship has not yet been 
investigated at the college level. Because 
of the substantial correlations found be- 
tween vocabulary and intelligence test 
scores (58 in this study), the finding of 
a high correlation between oral r 
errors and vocabulary, and between oral 
reading errors and intelligence test 


Scores would suggest certain interesting 
possibilities. 


eading 


B. Basic LIMITATIONS ОЕ THE Stupy 
A work of this nature has certain limi- 
tations which, while they are intrinsic to 
studies in this field, are sufficiently often 


ignored to Фе deserving of specific men- 
tion, 


ОЁ basic importance is the disagreement con- 
cerning the nature and exact definition of the 
proces which is central to our purpose—the 
reading proces. The particular viewpoint ES 
which one adheres would seem to be influential 
in determining the type of reading test which 
one will construct, and in determining the 
interpretations which will be placed upon the 
results of that test. The manner in which one's 
viewpoints may affect test construction and in- 
terpretation is illustrated aptly by a statement 
of Courtis, who some years ago argued that 
McCall's reading tests were in reality tests of 
thinking and not of reading. "My thesis is that 
since there are many types of reading situations, 
and it is necessary to restrict the term reading 
to one of them, the term should be used to 
connote the simplest form of the activity, namely, 
observational reading. In other words, reading 
to me means the degree of control possessed 
by the child over the reading mechanism, or 
to the excellence of the mechanism itself" (2, 
р. 291). There is no indication in the literature 
that Courtis has changed his position since that 
time. 

Holding to an Opposed view is the definition 
of reading in a recent text; “To summarize; 
reading is a psychological process in which the 
reader obtains meaning from printed words. . + * 
But clear images and the recognition of words 
are not enough; he must determine their mean- 
ing in a particular context. Interpretation of 
meaning involves memory of previous experi- 
ences, reasoning, and the reader's purposes and 
attitudes toward the situation as a whole" (8, 
р. 45). 

It may be gathered from the foregoing that 
the frame of reference within which an inves” 
tigator in this area will Operate is not a matter 
of common agreement, and hence, constitutes 
a basic limitation of such studies, It might be 
appropriate to add that Practically every form 
of behavior in this study—intelligence, spelling, 
vocabulary, reading rate, апа reading compre" 
hension—is а matter of considerable controversy’ 
and thus, exhibits similar limitations. 


C. SUMMARY 


This investigation, then, is based upo! 
these specific considerations: 

1. There is evident a current tendency 
to generalize broadly concerning thé 
values of oral reading tests for the diag- 


eS 


ih 
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nosis of silent reading difficulgies. 

2. The problem is created by the fact 
that there is little experimentation to 
justify such generalizations for the col- 
lege level. 


3. The goal of this present study is 
to furnish information in the light of 
which concepts of the diagnostic value 
of the oral reading test may be evalu- 
ated. 


CHAPTER II 


D 


THE STUDY GROUP AND INSTRUMENTS OF INVESTIGATION 


A. THE Stupy Group 


HE material for this study has been 
gathered from the scores of the 
eighty students whose grades placed 
them in the lowest academic quarter of 
a class of 312 freshmen at American In- 
ternational College for the second semes- 
ter ending in June, 1947. In all, 238 of 
these 312 were completely tested. 
As may be noted from Table 1, range 
and variance in age and length of time 


Since participation in the study was not 
voluntary on the students' part, and since 
the students as a group, in activities quite 
removed from this study, evidenced the 
emotionality indicative of feelings of 
personal insecurity, it was deemed in- 
advisable to inquire into data of such 
a nature as would allow the classifica- 
tion of the students into one of the 
schema of socio-economic status. 

Of interest, also, is the extent of stu- 


TABLE 1 


АСЕ, LENGTH or TIME BETWEEN LAST SCHOOL ATTENDANCE 
AND COLLEGE, AND INTELLIGENCE 


= 


N Mean sS T Range 
|. POP = 
Age (in Months) 8o 275.95 59.67 215-420 
Time between Last School Attendance and College En- 
trance (in Months) 8o 34.64 26.25 3-147 
American Council on Education Psychological Examina- 
tion, 1946 Edition (Raw Scores) 8o 05.39 22.83 53-163 


between last school attendance and col- 
lege entrance are considerable. Until 
clarification of the subject is available, 
the influence of these factors upon read- 
ing proficiency is largely conjectural. 

То summarize other descriptive data, 
the group of eighty students upon whose 
performance this study is based con- 
sisted of 74 males and of six females. Of 
the men all but fourteen were veterans. 
Of the women two were veterans. 
Seventy-five of the eighty had been ad- 
mitted to the college by certification, two 
had been admitted by entrance exam- 
ination, and three had been admitted 
on the basis of the United States Armed 
Force Institute Examination. 

Data on the language backgrounds of 
the sample are presented in Table ә. 


dent oral reading, since this might pos- 
sibly affect accomplishment on a test of 
oral reading. Of the 303 members of 
this freshman class, who returned conr 
pletely filled questionnaires, 260 stated 
that they read orally one hour a week or 
less. From the forty-three who claimed 
to read orally from one to fifteen hours 
a week, inquiry elicited the information 


TABLE 2 
LANGUAGES SPOKEN IN THE HOMES OF STUDENTS 
N=8o 

Language N 
Sua gro MS 1 
ere 41 

ICE 5 

c UMEN и а qnae id A 1 
Italian. me 14 
BOIS еа анне on mer te 
Е D 8 


| 


“ 
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that several were membersgof a radio 
work-shop, and that mapy of the others 
were engaged in various forms of com- 
munity group work which necessitated 
considerable oral reading. 


B. THE DEVELOPMENT OF AN ORAL 
INSTRUMENT FOR INVESTIGATION 


1. The Necessity for a New Oral 
Reading Test 


For the purposes of a study of oral 
reading at the college freshman level, 
there are available only two oral reading 
tests—a sub-test to be found in Strang's 
Examiners Diagnostic Reading Record 
(14) and a sub-test of the Diagnostic 
Reading Tests, prepared by a committee 
under the chairmanship of Triggs (18). 
Because of the grade range for which 
these tests were developed, and because 
of their limited range of material at the 
college level, neither of them was con- 
Sidered suitable for use in the present 
study, Determination of the diagnostic 
Usefulness of a test of oral reading for 
college freshmen would seem to demand 
the construction of an oral reading in- 
Strument for that purpose. 


2. Construction of the Oral Reading 
Test 

Since reading disability may be specific 
to certain types of material, the content 
of the reading passages should be func- 
Чопа], ie, should sample the fields of 
reading in which a college freshman en- 
Sages in his academic work. For the 
Freshman Oral Reading Test? the fields 
Of reading which were chosen, basing the 
Choice upon the program of general 
Studies for the freshman year at Ameri- 
2 International College, were psy- 
9:95, history, the natural sciences, and 


m v 


Will dy 


pe oral instrument devised for this stu 
© referred to hereafter by this title. 


English. The reading passages were se- 

lected from the following sources: from 

freshman texts which are no longer pub- 

lished because of text revisions, from the ` 
Sunday edition of the New York Times, 

and from sophomore texts. 

Comprehension questions were devised 
for the reading passages, and these ques- 
tions together with the selections were 
subjected to expert criticism and to re- 
vision. In its revised form the test was 
administered to a preliminary group of 
eighty students. Ап item analysis of the 
comprehension questions was based upon 
the performance of this preliminary 
group. Techniques used for the item 
analysis were those suggested by Davis 
(8, pp. 6-7), and the difficulty and dis- 
crimination indices for the comprehen- 
sion questions retained for the experi- 
mental form of the test are to be found 
in Tables 3 and 4. 

The test was then administered to the 
eighty members of the study group. Re- 
liabilities between Forms A and B, ob- 
tained from this administration, were: 
for time of oral reading, .go; and for 
oral comprehension, .67. 

Based upon experience with the pre- 
liminary group, the following oral read- 
ing errors were recorded on the experi- 
mental administration of the test: 

a. Mispronunciations—in cases in 
which the mispronunciation was not 
clearly owing to substitutions, omissions, 
or insertions within the word. 

b. Punctuation Errors—in cases in 
which change of voice did not indicate 
heed for the punctuation mark. It was 
[felt that punctuation errors would con- 
stitute a useable device for approximat- 
ing an objective measure of oral phrasing 
and rhythm. 

c. Omissions—in cases of omission both 
of words and of word parts. 
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TABLE 3 
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© 


PERCENTAGE OF SUCCESSES IN THE UPPER 27% AND LOWEST 
27% OF THE PRELIMINARY GROUP* 


N=80 
Form A 
L 1-2 1-3 I4 II-1 II-2 II-3 Ila 
A 34 78 78 68 5° 49 55 
8 5 41 44 21 ІІ 6 15 
II- III- IV-1 IV-2 IV-3 V-x V-2 VI 
ба P 67 61 64 83 83 73 
5 58 17 28 18 28 48 25 
VI-2 VII-r VII-2 VII-3 VII-4 VIII-: VIII-2 VIII-3 
54 51 31 43 83 83 97 43 
7 20 I 7 31 35 68 2 
Form B 
I-: 1-2 1-3 Il-r 11-2 II-3 ПІ-1 III-2 III-3 
52 68 93 68 70 77 бо 86 36 
20 3o 31 25 37 34 16 55 1 
III-4 Ш-$ IV-1 IV-2 У.т V-2 V-3 VI-: VI-2 
46 74 86 бо 48 84 8o 38 72 
16 43 22 22 II 54 II 12 28 
VI-3 VII: VII-2 VIII-1 VIII-2 VIII-3 VIII-4 VIII-s 
87 67 8 52 38 76 7 52 
38 4 43 20 12 25 34 17 


* Of each pair of numbers the peper gives the 
and the lower the percentage for 


d. Substitutions—in cases of substitu- 
tions of both words and of word parts. 

e. Repetitions—including any number 
of repetitions of a word. Even a single 
repetition may be indicative of oral dif- 
ficulties. 

f. Insertions—in cases of insertions of 
both words and word parts. 

8. Hesitations—A liberal allowance of 
three seconds was placed upon the allow- 
able limits of a hesitation in order to 
allow for variations in breath control, 
attention span, and so on. 

h. Word Stress—While it is by no 
means clear that this error would be of 
extended significance in a consideration 
of reading disability, since its probable 
origin may lie in the world's being more 
a part of the reader's reading vocabu- 
lary, than it is of his speaking vocabu- 
lary, the practice occurred a sufficient 


e lowest 27%, 


rcentage of successes for the upper 27% of the group 


number of times to justify its inclusion 
as an error to be scored on the experi- 
mental form of the test. 

As a device for choosing the oral errors 
which were to be used in the research, 
scatter diagrams of oral error perform- 
ance were plotted. The essential material 
from these scattergrams is presented in 
Table 5. 

It was decided arbitrarily that if over 
fifteen percent of the sample obtained 
а zero error score on any specific error, 
this error would not be Suitable for 
inclusion in the study. As may be noted 
from Table 5, Mispronunciations, Sub- 
stitutions, and Omissions warrant inclu- 
sion. Word Stress errors, Hesitations, and 
Repetitions do not constitute errors use- 
ful for our Purposes. Insertions and 
Punctuation Errors are of questionable 
value, but will be included. 
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TABLE 4 


o 
Davis's DISCRIMINATION AND DIFFICULTY INDICES FOR THE COMPREHENSION 
QUESTIONS OF THE FRESHMAN ORAL READING TEsT* 


N=80 
Form A 
I: 1-2 1-3 1-4 Il-1 II-2 II-3 III-1 
47 зо 24 23 31 30 38 30 
44 32 55 56 47 40 38 42 
III-2 III-3 IV-r IV-2 IV-3 V-1 V-2 VI-1 
49 42 35 21 kH 38 24 33 
42 66 46 47 45 53 58 49 
VI-2 VII-1 VII-2 VII-3 VII-4 VIII-1 VIII-2 VIII-3 
39 22 45 36 35 33 35 45 
40 42 36 36 54 54 69 34 
Form B 
I: I-2 1-3 II-1 II-2 II-3 III-1 III-2 III-3 
22 24 48 28 21 28 3ї 24 48 
42 49 56 48 52 53 44 бт 3r 
HI-4 111-5 IV-1 IV-2 V-1 У-2 У-3 VI-1 VI-1 
22 21 45 25 зо 22 50 21 22 
40 54 52 43 38 6o 48 36 53 
VI-3 VII-1 VII-2 VIII-1 уш? VIII-3 VIII-4 VIII-5 
35 53 30 22 2I 34 28 25 
56 42 58 42 36 51 33 42 


* Of each pair of numbers the discrimination index is the upper, the difficulty index is the lower. 


C. Tests or SILENT READING SKILLS 

The guiding considerations in the 
choice of tests of silent reading and of 
intelligence with which to correlate the 
Freshman Oral Reading Test were: (1) 
that they should be of college level, (2) 
that there should be information in the 


literature as to their reliabilities and 
usefulness, and (3) that their administra- 
tion and scoring should not be so com- 
plex as to render their use impracticable 
under the conditions in which this in- 
vestigation was conducted. 

In the light of these considerations 


TABLE 5 


Error PERFO! 


RMANCE OF THE SAMPLE ON FORMS А AND В 
or rHE ORAL READING TEST 
N 


=8o 
Mean e Du Мова r between 
Forms A orms orms ‘orms 
Type of Oral Error ОВ and B ond B EROS 
"—— M e sl 
Word Stress .8 1.1 .9 UII 
Hesitation 1.3 637. ne +30 
unctuation Errors 3-5 4.2 +2 Er 
Insertions 1.9 ae 225 .29 
Substitutions 3.0 25 8 3. .59 
missions 3.8 16 d ES 
ispronunciations 5.2 em 1: . 
epetitions 3.6 4.5 4 -70 
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the following examinations were selected. 

1. Intelligence—American Council on 
Education Psychological Examination 
for College Freshmen, 1946 Edition. 

з. Vocabulary—Cooperative English 
Test С 2: Reading Comprehension 
(Higher Level), Form Q, Part 1. 

3. Rate and Level of Comprehension 
—Cooperative English Test C 2: Reading 
Comprehension (Higher Level), Form 
О, Part II. 

4. Speed of Reading—Minnesota Speed 
of Reading Test for College Students, 
Form B. 

5. Sentence Structure—The Purdue 
Placement Test in English for High 
Schools and Colleges, Form A, Part IV. 

6. Grammatical Classification—The 
Purdue Placement Test in English for 
High Schools and Colleges, Form л, 
Part II. 


7. Recognition of Grammatical Errors 
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—The Purdue Placement Test in Eng- 
lish for High Schools and Colleges, Part 
III. 


D. INTERCORRELATIONS OF THE SILENT 
READING AND INTELLIGENCE TESTS 
In order to determine whether or not 

these tests of language skills and of in- 
telligence could be presumed to measure 
relatively independent abilities, perform- 
ance on them by the students for whom 
complete results were obtained was inter- 
correlated. Table 6 presents the results 
of such intercorrelation. All correlations 
are of raw scores. 

The list preceding Table 6 identifies 
the numbers used for coding the lan- 
guage tests included. The means and 
the standard deviations presented at the 
foot of each column are these measures 
for the scores obtained on that test the 
code number for which stands at the 
head of the column. 


TABLE 6 


INTERCORRELATIONS OF THE CRITERION TESTS 
N=238 
1. Cooperative English Test C 2, Vocabulary Score 
2. Cooperative English Test C 2, Speed of Comprehension 


3. Cooperative English Test C 2 
4. Cooperative English Test C 2, Total Score 
5. Minnesota Speed of Reading, Form B 


, Level of Comprehension Score 


6. Purdue Placement, Grammatical Classification Score 


7. Purdue Placement, Recogn 


ition of Grammatical Errors Score 


8. Purdue Placement, Sentence Structure Score 


9. A.C.P.E., “Q” Score 
10. A.C.P.E., “L” Score 
тт. A.C.P.E., Total Score 


I 2 3 4 5 6 7 8 9 то II 

I . 844 -596 -840 -432 -385 .266 274 239 . 692 .580 

2 .818 -985 542 212 +315 +383 -396 .712 .675 

3 -852 | .455 355 240 | .328 | .394 | .583 629 

4 :499 | -393 | .296 | .353 | .доз| .678 .661 

5 143 +251 +264 +339 +472 -490 

6 -494 | .366 | .199 | .8rr .386 

7 -238 | .135 | .344 321 

129 300 -354 

9 524 .817 

то ‚874 

II 

M. [20-9 17-4  (r1.6 моо ar тт. 19.5  |16.7 4 68.0  |108.9 
S.D. | 9-9 10.6 7-4 [25.5 5.2 5.1 9.2 4.8 3 8.5 23.1 


CHAPTER III 


PRESENTATION OF THE FINDINGS 


° 
А. GUIDING CONSIDERATIONS 
T critical ratio is used as a test of 
the significance of the 77% presented 
in this section. The convention which 
has been followed is that ratios between 
1.96 and 2.58 may be regarded as being 
significant, and that those ratios which 
àre larger than 2.58 are to be regarded as 
being very significant. Kelley's "one-third 
Sigma rule" has been followed in re- 
porting the findings of the study (7, 
P. 223). 
B. A Srupy or THE ORAL ERROR SCORES 
The following Tables, 7 to 14, present 
the results obtained from correlating the 
oral errors—Mispronunciations, Substitu- 
tions, Omissions, Insertions, and Punctu- 
ation Errors, as well as scores of oral 
comprehension, total oral errors, and 
time of oral reading—with the silent 
reading measures used in the investiga- 
tion. These oral test scores are based 
upon the use of both Forms A and B of 
the Freshman Oral Reading Test con- 
Sidered as if they together constituted 
but one examination. 


1. Omissions and Punctuation Errors 
As is to be noted from an inspection 


TABLE 7 


ConRELATIONS OF OMISSIONS Errors 
with OTHER Test RESULTS 


— 


re 


= ‘Test 

Cooperative Speed of Comprehension mer 
operative Level of Comprehension а 
operative Vocabulary i 

М а Total f Readi д 

sota Speed of Reading : 

urdue Sentence Structure 98 
urdue Recog. Gram. Errors 09 
urdue Gram, Class. po 
Merican Council Psychological 3 
9tal Oral Comprehension m 


* я я 5 
ig ene of the r's are significant at the 5% 
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of Tables 7 and 8, neither the omission 
of words or word parts nor the extent of 
disregard for punctuation marks during 


TABLE 8 


CORRELATIONS OF PUNCTUATION ERRORS 
WITH OTHER Test RESULTS 


Test #* 
Cooperative Speed of Comprehension .09 
Cooperative Level of Comprehension -04 
Cooperative Vocabulary .02 
Cooperative Total .03 
Minnesota Speed of Reading ET 
Purdue Sentence Structure II 
Purdue Recog. Gram. Errors +03 
Purdue Gram. Class. —.04 
American Council Psychological .07 


Total Oral Comprehension — .35 


* None of the r's are significant at the 5% 
level. 
oral reading are related significantly to 
any of the silent tests of language skills 
used in this study. This is a finding of 
interest since the Punctuation Error 
score was intended as an objective meas- 
ure of phrasing during oral reading. 


2, Insertions 


The insertion of added sounds and 
words during oral reading correlates sig- 
nificantly with the results of but one 
test, the Cooperative Reading Vocabu- 
lary Test (Table 9). 


3. Substitutions 


The relationships obtained between 
this type of error—the substitution of 
words or word parts during oral reading 
—and the silent tests were of interest to 
this writer in view of the importance 
given to this error in several of the 
studies of the value of oral reading. How- 
ever, as is evident from Table 10, Sub- 
stitutions are correlated significantly 
with none of the criterion scores except 
with those of the Purdue tests of Gram- 
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TABLE 9 


CORRELATIONS OF INSERTIONS ERRORS 
WITH OTHER TEST RESULTS 


* TABLE ro 


CORRELATIONS OF SUBSTITUTIONS ERRORS 
WITH OTHER TEST RESULTS 


Test r ar 
Cooperative Speed of Comprehen- 
sion SEDES. 
Cooperative Level of Comprehen- 
sion «X9 | 222 
Cooperative Vocabulary 227 |. .x0* 
Cooperative Total Жо | Xm 
Minnesota Speed of Reading «18 || a 
Purdue Sentence Structure аа |) coer 
Purdue Recog. Gram. Errors 203 | XY 
Purdue Gram. Class. SET || aL 
American Council Psychological -06 | „ІІ 
Total Oral Comprehension | +05 |o 


Test r or 
Cooperative Speed of Comprehen- 
sion r | ай 
Cooperative Level of Comprehen- 
sion £418 | оят 
Cooperative Vocabulary 0 | sim 
Cooperative Total .o08 | „ІІ 
Minnesota Speed of Reading t$ | „1% 
Purdue Sentence Structure —.06 | .11 
Purdue Recog. Gram. Errors sai | s 
Purdue Gram. Class. ‚2@ | «її 
American Council Psychological 0$ | aT 
Total Oral Comprehension —.07 II 


Рот. 


matical Classification апа Recognition 
of Grammatical Errors. 


4. Mispronunciations 


Of the errors studied in this investig.- 
tion, Mispronunciations (Table 11), con- 
sisting of gross errors in the pronuncia- 
tion of words, are correlated with more 
of the tests of silent reading skills than 
are any of the other oral error scores used 
in the study. Such errors show a signifi- 
cant correlation with the tests of the 
level of silent reading comprehension, 
vocabulary, and the psychological exam- 
ination. To be noted is the fact that 


TABLE її 


CORRELATIONS OF MisPRONUNCIATIONS ERRORS 
WITH OTHER TEST RESULTS 


Test r or 

Cooperative Speed of Comprehen- 

sion 2ї її 
Cooperative Level of Comprehen- 

sion . +29 | .1of 
Cooperative Vocabulary -42 | .ogf 
Cooperative Total :38 | „тоў 
Minnesota Speed of Reading sI2 xr 
Purdue Sentence Structure +03 | „ІІ 
Purdue Recog. Gram. Errors ло | ¿zy 
Purdue Gram. Class. +18 | тї 
American Council Psychological +29 | .xof 
Total Oral Comprehension 225 |! pg 

* o5 >P>.01. 

t Р «.or. 


* 05 P. or, 


there is a significant difference in the 
size of the correlation between this error 
and vocabulary compared to the size of 
the correlation between Mispronuncia- 
tions and level of silent comprehension 
(12, p. 185). With regard to the latter 
the correlations between Mispronuncia- 
tions and level of silent comprehension 
and mispronunciations and oral compre- 
hension show no significant difference.’ 

In view of the size of the correlation 
between Mispronunciations and the psy- 
chological examination, the problem of 
the possible role played by mental ability 
in the relationship observed between 
Mispronunciations and the other vart 
ables is relevant. Holding intelligence 
constant (12, p. 243), significant partial 
7s are obtained between Mispronuncla- 
tions and vocabulary, and between Mis- 
pronunciations and oral comprehension.” 


5. Oral Comprehension 


Oral comprehension, as measured by 


1 
Tiz — fis = 04; Ori; — Тз = 12 


° Mispronunciations and Level of Silent Сол!" 
prehension: 


á Тоз.а = .14; Ооа = «11 
Mispronunciations and Vocabulary: 
б To1.2 = .22; Ото 2 = 10 E 
Mispronunciations and Oral Comprehension: 
Тоз = .18; ото. = .08 
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TABLE rz е 


CORRELATIONS OF ORAL COEPREHENSION WITH 
OTHER Test RESULTS 
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TABLE 13 


CORRELATIONS OF ToTAL ORAL TIME OF READ- 
ING WITH OTHER TEST RESULTS 


Test r or 

Cooperative Speed of Comprehen- 

sion — .42 | .09Ї 
Cooperative Level of Comprehen- 

sion .39 | -1of 

ooperative Vocabulary .38 | - rol 

‘cooperative Total -45 | -oof 

Innesota Speed of Reading .08 | „ІІ 

urdue Sentence Structure .17 | -11 
Purdue Recog. Gram. Errors .18 | 2¥ 
E urdue Gram. Class. .13 | 11 
American Council Psychological 25 ir 

* .05>P>.01. 

1 P €.o1. 


the correctly answered check items of the 
Freshman Oral Reading Test, is related 
Significantly with the three part scores 
апа the total score of the Cooperative 
English Reading Comprehension Test 
and with the psychological examination 
(Table 1з). With mental ability held 
Constant, the resulting partial 1's of these 
Variables are significant? 


6. Time of Oral Reading 


The results of the correlation of this 
Oral score with the other measures used 
in the study are presented in Table 13. 
Ot interest is the observation that there 
is not a significant difference between 
the correlations of Time of Oral Read- 
ing and time of silent reading as meas- 
Ured, on the one hand, by the Coopera- 
tive Speed of Comprehension score and, 
Оп the other hand, by the Minnesota 
Speed of Reading results.’ 
enen prehension and Speed of Silent Com 
mara Comp O Level of Silent Com- 

ehension: 


Tor.2 = .31; OFo1.2 = +10 
Oral Comprehension and Vocabulary: 


О: Тоз = .30; Ofor.2 = +10 š 
Scoren Comprehension and Cooperative Total 


To1.2 = .40; OTos.2 = +10 


4 
Та =. 
12 — fu = 05; Ofn — Tis = 10 


Test r or 

Cooperative Speed of Comprehen- 

sion .38 | „Іоў 
Cooperative Level of Comprehen- 

sion 10 || т 
Cooperative Vocabulary 25 | .xx* 
Cooperative Total +31 | лоў 
Minnesota Speed of Reading .33 | -1of 
Purdue Sentence Structure +03 | „11 
Purdue Recog. Gram. Errors 307 | «II 
Purdue Gram. Class. 06 | „11 
American Council Psychological ло | її 
Oral Comprehension .оо | „оо 

* os» P ».ort. 

1 P <.or. 


7. Total Oral Errors 


As with Substitutions so with Total 
Oral Errors we have a score which is of 
Zaterest in view of the fact that its rela- 
tionships have constituted one of the 
major findings of several related studies. 
For example, Gilmore, who used an oral 
test of the same general type as that of 
this investigation, with a group consist- 
ing of practically all of the students in 
the first eight grades of one school sys- 
tem, found a mean т of .67 between total 
errors and silent reading comprehension 


TABLE 14 


CORRELATIONS OF TOTAL ORAL ERRORS WITH 
OTHER TEST RESULTS 


Test f or 

Cooperative Speed of Comprehen- 

sion | жай, 
Cooperative Level of Comprehen- 

sion .24 | .тї* 
Cooperative Vocabulary .34 | лоф 
Cooperative Total ` .29 | .1of 
Minnesota Speed of Reading .18 | .1r 
Purdue Sentence Structure .08 | „Іт 
Purdue Recog. of Gram. Errors .23 | .тї* 
Purdue Gram. Class. P I5 | .тт 
American Council Psychological .20 | .1x 
Total Oral Comprehension ло | viz 
Total Oral Time .38 | лоў 

* og» P 2.01. 

1 P <.or. 
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for the eight grades (5, p. 77). Swanson, 
who worked with two groups of college 
freshmen, one of poor and one of good 
readers, and who recorded their oral 
reading of three paragraphs, found an'r 
of .44 between total oral errors and silent 
reading comprehension (15, p. 52). In 
contrast, in this present study, which is 
based upon the use of an oral reading 
test with eighty college freshmen of low 
academic achievement, the correlations 
between Total Oral Errors and both oral 
comprehension and the psychological 
examination are not significant at the 
5% level of confidence, although they 
are almost of a size to be acceptable, .20 
and .19. Nor is this error score corre- 
lated significantly with speed of silent 
comprehension, nor with the tests of 
sentence structure, except for the Purdue 
test of Recognition of Grammatical Er- 
rors. Total Oral Errors are related equal- 
ly with level of silent comprehension and 
with vocabulary.5 This score is signifi- 
cantly correlated, as well, with time of 
oral reading. As may be noted from 
Table 14 the significant correlations are: 
with level of comprehension, .27; with 
vocabulary, .34; with the Cooperative 
total score, .29; with recognition of gram- 
matical errors, .23; and with time of 
oral reading, .98. 

The size of these correlations when 
compared to those obtained by Gilmore 
(Table 15) leads to a consideration of a 
probable relationship between Total 
Oral Error Scores and grade level. 

According to the tendency evidenced 
by Gilmore's findings (Table 15), one 
might well expect to find a correlation 
between Total Oral Errors and silent 
reading comprehension which would be 
lower at the college freshman level than 


ry — Tg = 07} GT — Ta = .06 


the т of .5Š found by Gilmore at the 
eighth grade. Yhat Gilmore may have 
suspected the existence of such a trend 
is suggested by his comparison of his 
findings with the r of .44 obtained by 
Swanson between the same variables, 
and his conclusion that oral reading hab- 
its are more closely related to silent 
reading comprehension at the elemen- 
tary than at the college level (5, p. 154). 
Regressing Gilmore's correlations upon 
their grade levels, and weighting the 
coefficients in order to provide for dif- 
ferences in grade size (7, p. 352), yields 
a regression coefficient for correlation co- 
efficient regressed on grade of —.0481. 
Е, х2) equals 8.3014 (7, p. 358), t equals 
2.88 (7, p. 357), and thus, we have а 
basis for rejecting the hypothesis that the 
differences in size to be observed among 
the correlations obtained by Gilmore are 
ascribable to chance deviations from a 
true correlation of zero between coeffi- 
cient of correlation and grade. A sug- 
gested corollary would seem to be the 
decreasing value of a test of oral reading 
as a source of diagnostic clues concern- 
ing the nature of silent reading compre 
hension, as grade level advances. 


TABLE 15 
CORRELATIONS AT GRADES II THROUGH VIII BE- 
TWEEN ORAL READING SCORES AND COMPRE- 
HENSION SCORES ON THE STANFORD TEST 
(5, р. тто)* 


Correlation between 
Oral Reading Errors 
and Comprehension 
Scores on the Stanfor 
Test (Part I) 


Grade 


918 
631 
eaae mas OOF 
vica d ss 693 
SEERA ‘CBO 


* N=138 for Grades II and III, N=184 for 
Grades IV, V, and VI, and N=138 for Grade? 
VII and VIII. 


* 
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e CHAPTER IV 


EVALUATION OF FINDINGS AND SUMMARY 


A. EVIDENCE or THE DIAGNOSTIC VALUE 
or SPECIFIC TYPES OF 
READING ERRORS 


F THE oral errors used in this study 

Omissions and Punctuation Errors 
are related to none of the silent language 
tests, Insertions are correlated .27 with 
vocabulary, Substitutions errors are re- 
lated with but two of the silent tests— 
-22 with the Purdue Test of Grammati- 
cal Classification, and .22 with the Pur- 
due Recognition of Grammatical Errors. 
The M ispronunciations error, considered 
from the viewpoint of its diagnostic use- 
fulness, as determined by the number of 
tests with which it is correlated, olfers 
the most promise. Its correlation with 
oral comprehension is .25; with silent 
Comprehension .29, and with vocabu- 
lary is .42. 

These figures would seem to suggest 
that gross mispronunciations during oral 
reading are related to some extent with 
the vocabulary level of the reader. How- 
ever, it is difficult to conceive of a man- 
ner in which these oral errors might re- 
veal information about a student's vo 
Cabulary difficulties which could not be 
More easily and directly obtained from 
а vocabulary test itself. . 

In general, it may be said that in view 
9f the possible number of significant cor- 
relations which might have been ob- 
'ained between specific types of oral 
error and tests of language skills, the 
number actually obtained is extremely 
Small. This finding, when taken in con- 
Junction with Gilmore's discovery at the 
clementary level that of the seven types 
of oral errors which he studied, only two 
Substitutions and Words Pronounced 


by the Examiner—are significantly re- 
lated to silent reading comprehension 
5, рр. 145-6), would suggest that spe- 
cific types of oral error are of but in- 
different value for the furnishing of clues 
in the diagnosis of reading difficulties, 
either oral or silent. This would further 
suggest that such analyses of specific oral 
errors and their diagnostic significance 
as those of McCullough, Strang, and 
Traxler (8, pp. 246-273) have little sound 
basis. 


B. Tur RELATIONSHIP OF ORAL PHRASING 
AND READING COMPREHENSION 


_ The Punctuation error, consisting of 
"the student's failure to change the voice 
appropriately, was intended as an objec- 
tive measure of the student's oral phras- 
ing. In lieu of some such measure it 
would seem that estimates of the sub- 
ject's skill in oral phrasing would tend 
to be somewhat subjective. However, 
insofar as Punctuation Errors served this 
purpose, “no evidence was furnished by 
this study to indicate that phrasing dur- 
ing oral reading is related to either silent 
or oral comprehension. One would not 
wish to push the suggested conclusion 
overly far in view of the suspected inade- 
quacy of the Freshman Oral Reading 
Test as a measure of this error. How- 
ever, the logic of such a conclusion 
would be supported by the not infre- 
quent observation of students who, to 
all appearances, read quite well orally 
with little or no comprehension of the 
material read. (In this connection we 
might cite the correlations of .27 be- 
tween Total Oral Errors and level of 
silent reading comprehension, and .19 
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between Total Oral Errors and Oral 
Comprehension.) 


C. THE RELATIONSHIP OF MENTAL 
ABILITY TO THE ORAL AND 
SILENT VARIABLES 


The influence of mental ability upon 
the relationships between the oral and 
silent scores has been determined by 
holding mental ability constant and test- 
ing the significance of the consequent 
partial r’s. In only one case, that of the 
correlation between Mispronunciations 
and level of silent reading comprehen- 
sion, was the resulting partial r non- 
significant. This would suggest that al- 
though mental ability is a factor in the 
oral-silent reading relationship, this re- 
lationship may be explained in terms 
other than those of the influence of ¢ 


factor of mental ability common to the 
two variables. 


D. THE RELATIONSHIP OF ORAL READING 
TO MASTERY OF SENTENCE ELEMENTS 


The two instances of relationship be- 
tween oral errors and tests of mastery 
of sentence structure have been men- 
tioned previously. These are the corre- 
lations of .22 between Substitutions and 
grammatical classification, and of .22 be- 
tween Substitutions and recognition of 
grammatical errors. These r’s would seem 
to indicate an extremely slight tendency 
among freshmen of low academic 
achievement who are uncertain about the 
structural components of an English 
sentence to betray that uncertainty by 
the making of substitutions in their oral 
reading. 

Of seemingly greater importance is the 
failure in this study to find a significant 
correlation between tests of mastery of 
sentence elements and oral comprehen- 
sion. If this relationship were to be 


found, one;implication might be the 
indicated value of instruction in the 
elements of sentence structure as a part 
of a program of instruction in reading. 
Our findings would not seem to serve to 
justify this suggestion. 


E. THE RELATIONSHIP OF ORAL READING 
WITH VOCABULARY 


The correlation between Mispronun- 
ciations and vocabulary is .42; between 
Time of Oral Reading and vocabulary 
it is .34. Between Oral Comprehension 
and vocabulary the r is .38, which is not 
significantly different from the r of the 
same oral variable with level of silent 
reading comprehension. This would sug- 
gest that oral reading comprehension is 
related about equally both with silent 


reading comprehension and with vocabu- 
lary. 


Е. THE RELATIONSHIP OF ORAL READING 
COMPREHENSION TO SILENT READING 
COMPREHENSION 


Oral reading comprehension is, of the 
oral variables studied, related most sub- 
stantially to silent reading comprehen- 
sion. The ? between Oral Reading Com- 
prehension and the total score of the 
Cooperative English Reading Compre- 
hension Test C 2 is .45. (This latter score 
15 à composite score representing per- 
formance on tests of speed of silent read- 
ing comprehension, level of silent read- 
ing comprehension and vocabulary.) In 
view of the possible effects of differing 
methods of testing reading comprehen- 
sion upon the rs obtained between two 
tests, which have been pointed out by 
Eurich (4, РР. 404-6) and Tinker (17, 
PP- 217-18), we may be justified in re- 
garding this r as being a somewhat too 
low indication of the true relationship 


existing between the two variables, Con- | 


\ 
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Sequently, it would seem safe to suggest 
that this r indicates that college fresh- 
men of low achievement who understand 
well what they read orally tend also to 
understand well what they read silently. 

It may be remarked, however, that this 
observation will be hardly surprising to 
those who work in the field of reading 
difficulties, and would seem to have little 
bearing upon the problem of whether or 
not difficulties in word attack during 
oral reading are indicative of similar 
difficulties in word attack during silent 
reading. 


G. THE RELATIONSHIP OF TIME OF ORAL 
READING TO TIME OF SILENT READING 


Time of Oral Reading is correlated 
significantly with speed of silent compre- 
hension, the r being .45. It is not sig- 
nificantly correlated with level of silent 
comprehension. With Total Oral Errors, 
Time of Oral Reading is correlated .38, 
and with vocabulary, .25- The coefficient 
of .38 between Time of Oral Reading 
and Total Oral Errors may be compared 
to the r of .56 found by Swanson (15, P- 
46). These findings would seem to indi- 
Cate a degree of relationship between 
Speed in silent reading and speed in oral 
reading. (However, any consideration of 
ап application of this finding would in- 
Volve, as well, the observation that Time 
of Oral Reading is not significantly cor- 
related with Oral Comprehension.) 


Н. Tur RELATIONSHIP OF TOTAL ORAL 
Errors ro THE LANGUAGE TESTS 

This error score, which formed the 
main oral measure of Gilmore's study of 
a general elementary school sampling, 
Showed lower evidence of relationship at 
the college freshman level with students 
of low academic performance. For ex- 
ample, significant correlations were not 
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found between this error and mental 
ability for the freshman group of this 
study. The correlation of Total Oral 
Errors with level of silent reading com- 
prehension in this present study is .27, 
and between this error and vocabulary 
is .34. These two correlations are not 
significantly different in size. 

These 175 of .27 and .34 are somewhat 
lower than the r of .56 which Gilmore 
found between total oral errors and 
silent reading at the eighth grade. How- 
ever, as has been seen from an examina- 
tion of Gilmore's т'5 for these two vari- 
ables from grades two through eight, 
there is a steadily decreasing trend in the 
size of the r's as the grade level ad- 
vances. The hypothesis has been tested 
that the differences to be observed in the 
size of the correlations obtained by Gil- 
‘more represent chance differences from a 
true correlation of zero between correla- 
tion coefficients and grade level, and 
it has been observed that Gilmore's 7's 
are significantly related to their grade 
levels. It seems reasonable, consequently, 
to infer that the lower 7's found in this 
present investigation between Total Oral 
Errors and silent reading comprehension 
represent correlations of such a size as 
might be expected in the light of the 
decreasing size of the 7's between total 
oral errors and silent reading compre- 
hension as the grade level advances. 

This observation leads to certain 
further considerations. The first of these 
is that since the shift from oral to silent 
reading is made in the second grade, it 
would not seem to be illogical to pre- 
sume that habits of beginning silent 
reading will be largely those of oral read- 
ing. This would serve to account for the 
r of .9з found by Gilmore between the 
two variables at the second grade. It 
would further seem to indicate that diffi- 
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Form A 


I 


Excessively long study periods are to be avoided on account of the diminishing returns which 
result from fatigue. But few students over-exert themselves in matters of study, although many 
commence their daily program in a tired condition. Their fatigue may frequently be attributed 
to preoccupation with details and to conflicts arising from excessive indulgence in social activities, 
or from long hours spent in occupational endeavors. Odd moments and periods between classes 
should be utilized, but a workable plan makes provision for periods long enough for concentrated 


and sustained effort. 
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o п 

Right belief and credulity;refer to habits of mind as well as to standards of evidence. “Thinking 
straight is essential to seeing straight. The evidence grows out of the attitude far more than the 
attitude results from the evidence." Some people are so entrenched in their own convictions that 
they resist the truth. It is easy for an anti-vivisectionist, for example, to discredit the values of 
animal dissection in the progress of medical research. Most people believe too much rather than too 
little. Credulity is rampant. Ours may be a scientific age, yet this era is the hey-day of the fortune 
tellers, 
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n [1 
We are today in the midst of those convulsions. Germany is divided and broken, slowly starving, 
a cancerous growth in the heart of Western Europe, a breeding place for the dark philosophy of 
Nietzche. There festers in her ruins the poison of fascism, the virus of communism. Either totali- 
tarianism flourishes in decadence and destruction. In the Netherlands, the wealth of the Indies no 
longer builds stately mansions; scores of thousands of Dutch want to emigrate from broken Europe. 
Belgium is scarred by old wounds and ancient animosities, and France—still politically bankrupt and 

ethically and intellectually confused—lies weak and torn and divided. 
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e IV 


The first attack on merchintilism developed in France in the eighteenth century. Men known as 
physiocrats began to argue that governments would do well to leave business alone. Deducing their 
ideas from the Newtonian faith in natural law, they believed that there were natural laws in 
economics, which were as immutable as the natural laws of astronomy. Laissez faire | (let things 
alone)—let nature run its course—and all will be well, these men proclaimed. The physiocrats were 


very theoretical in their beliefs, and living in an agricultural country, they considered agriculture 
to be the principle source of all wealth. 
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1. The process described in this passage is one of 
а. cell composition 
b. decay 
с. thickening 
d. function s 
e. development 
2. Mechanical tissue is formed by the thickening of the 
à. walls 
b. bundles 
c. meristematic cells 
d. strands 
e. internodes 
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in experimentation is оһу assa; 
r in 1 lated, ассо 
g: cording to this P ge by 


b. thorough training 
c. repetition 
d. analysis 
e. preliminary study 
The purpose of this writer is to 
a. warn 
р. describe 
с. persuade 
d. admonish 
c. entertain 
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One wishes to avoid exaggeration, but it seems safe to say 
frequently, when the spirit of comedy makes life endurable. 
Serious, we can bear with fortitude, but they are for the m 
come. But the lesser bafflements, the grotesque inconsistenci: 
the things that though individually they may be of no p 
sand in one's shoes, a sliver under the skin, a cind 
slate clean and refreshes the soul. 


that there are times, and not in- 
The tragedies of life, because they arc 
ost of us rare, and to some they never 
es, the irritating irrelevancies—these are 
ower to quell, in the aggregate are like 
er in the eye. It is comic laughter that wipes the 
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, Probably at no other time in tc world's history has the individual been so much occupied with 
himself as in the years through which we have recently been living. The mark of that self-absorption 
has been deep upon our literature. The state of mind which it represents has been especially acute 
among men and women of intellectual pretensions, but the malady has by no means been confined 
to them. If it had been there would have been nothing like the market which has existed during 
Tecent years for the books of self-help which have appeared in such profusion. 
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Some courses, because of either intrinsic difficulty or the length of the assigned readings, absorb 
so large a share of study time that the danger of slighting other subjects is great unless recreational 
activities are curtailed. A student lacking the skills necessary to satisfactory performance in a college 
course inevitably faces the need of expending a great amount of time in maintaining normal 
progres. Thus, inadequate training in mathematics prevents acceptable work in physics and 
chemistry, just as ignorance of the rudiments of grammar handicaps the writing of creditable 
compositions. By dint of hard work, fundamental skills can be acquired. 
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п 
failures forgotten. The-stock answer is that the stars 
hich is admittedly an argument to end all arguments. 
The richest field for the study of credulity lies in medical quackery whether it be a movie star who 
attributes her perfect teeth to some particular brand of toothpaste, or a distinguished foreign 
Physician who recommends ycast for a specific ailment in such a way as to allow the advertiser to 


seduce the public into believing that a cure for infections must be a medicine for all ills. 


o 
Most successes in prophecy are remembered, 
are never wrong, but that astrologers may be, w 
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The story tells how Enmerkar, employing a "modern" diplomatic approach, sent an NONE, 
to the Lord of Aratta and suggested surrender. The latter, the epic нен oi he ke 
protege of Inanna, the great Sumerian goddess of love and war, and thus diplomatical de E 
proposal. Enmerkar then unleashed 2n ingenious psychological маг of nerves" by sen jing ae 
to the Lord of Aratta and his people. He entwined the threats with favors such as SUPp um. Wie 
for the people. The basic characteristics of man's society have remained relatively unc hanged s 
three thousand years before the days of Christ, Julius Caesar, or the Roman Empire. 
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as both stimulated and transformed by the new 
the beginning of that ега Were those of the 
f the state and the subordination of the 


in economics as in politics, the move- 


Economic thought in the eighteenth century w 
Wealth. The prevailing economic doctrines at 
mercantilist school, which upheld the omnipotence O 


individual to it. Ever since the advent of the Tudors, i 1 101 
Ment was away from the small unit, the guild and the town, and toward intensified competition 


along national lines. The prosperity of a country was thought to depend upon governmental 
regulation of trade and industry. Wealth was thought of in terms of actual gold and silver. 
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In his search for facts and principles, it is the aim of the scientist-to proceed in a purposeful, 
orderly, and logical manner. He must first be adequately prepared to proceed. The modern chemist, 
physicist, or biologist must serve an arduous apprenticeship in his particular branch of science. So 
early is this specialization begun that too often no broad basis is laid and no really adequate founda- 
tion is afforded for a lifetime of effective citizenship or even of scientific research. Many a biologist, 
for example, has been balked of high attainment through inadequate training in chemistry or in 
physics. 
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Some readily observable evidences of growth are increase in the size and number of organs. A 
gardener thinking in terms of some particular crop, measures growth chiefly by the development of 
the particular part or parts of the plant yielding the products in which he is interested. A forester 
measures growth primarily in respect to the height and diameter of his trees. A student of botany 
may be concerned with the grosser evidences and measurements of growth; but for a broad grasp 
ОЁ the growth concept it is essential also to consider the variety of growth phenomena, the manner 
in which growth proceeds, the detailed mechanism of growth processes, the environmental factors. 
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Greatness in prose will always be on a lower level than greatness іп poetry. We have seen that 
what poetry can achieve by the magic of emotive language, imagery, rhythm, and the other devices 
of communication, prose must attain by the more pedestrian means of cumulative detail, and by 

^ exposition, example, illustration, and the other devices of rhetoric and logic. Prose can never hope 
to speak so intimately to the imagination of the reader, though in its more impassioned passages 
it may break over into the domain of poetry. Yet, in its secret of greatness, prose is ncver essentially 
different from poetry. 
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To what purpose, April, do you return again? 
Beauty is not enough 
You can no longer quiet me with the redness 
Of little leaves opening stickily. 
I know what I know. 
"The sun is hot on my neck as I observe 
The spikes of the crocus. 
The smell of the earth is good. 
It is apparent that there is no death. 
But what does that signify? 
Life in itself is nothing 
An empty cup, a flight of uncarpeted stairs. 
It is not enough that yearly, down this hill, 
April 
Comes like an idiot, babbling, and strewing flowers. 
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APPENDIX B ¢ 


ACKNOWLEDGEMENTS TO PUBLISHERS = 


The reading passages of the Freshman Oral 
Reading Test were adapted from the following 
sources with the permission of the publishers 
cited therewith. 

i. Passage I, Form A and Passage I, Form B, 
were adapted from Charles Bird's Effective study 
Habits published by D. Appleton Century Com- 
pany, New York. 

2. Passage II, Form A, and Passage II, Form 
B, were adapted from Wayland Vaughan's Gen- 
eral Psychology published by the Odyssey Press, 
New York. 

3. Passage III, Form A, and Passage III, Form 
B, were adapted from the New York Times 
for March 2, 1947. Passage VIII, Form A, was 
adapted from the New York Times for March 30, 
1947. These adaptations were made with the per- 
mission of the managing editor and with the 
permission of the editor of the New York Times 
Book Review. 
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4. Passage IV, Form A, and Passage IV, Form 
B, were adapted from A History of England and 
the British Empire by W. P. Hall, R. С. Albion, 
and J. B. Pope. Ginn and Company, New York, 
are the publishers. 

5. Passage V, Form A, and Passage V, Form 
B, were adapted from E. E. Stanford's Man and 
the Living World published by the Macmillan 
Company, New York. 

6. Passage VI, Form A, and Passage VI, Form 
B, were adapted from 4 Textbook of General 
Bolany by G. M. Smith, R. E. Evans, B. M- 
Duggar, С. L. Bryan, and C. E. Allen. The 
Macmillan Company, New York, are the pub- 
lishers. 

7. Passage VII, Form A, and Passage VII, and 
УШ, Form B, were adapted from Philo М. 
Buck's Literary Criticism, A Study of Values 
in Literature published by Harper and Brothers, 
New York. 
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CHAPTER I 


e 
ТНЕ PROBLEM AND AIMS OF THE STUDY 


THEORIES RESPECTING THE RELATIONSHIP 
BETWEEN Јов SATISFACTION, PERSONALITY, 
AND VOCATIONAL INTERESTS 


1 job, in our culture, has been 
variously described as a symbol. of 
status, a source of self-respect, a funda- 
mental basis for the formulation of 
values, and an important contributor to 
mental health. A modicum of satisfac- 
tion has been recognized as essential to 
the individual as one of the products of 
his work. Work satisfaction should be 
assessed in connection with interests and 
personality so that the contributions of 
each may receive proper emphasis. The 
relation of work satisfaction to the de- 
Bree of personality adjustment and voca- 
tional interest has been speculated upon. 
In counseling situations, work dissatis- 
faction has often been ascribed to person- 
ality maladjustment, lack of appropriate 
Vocational interests, or both. This point 
of view was demonstrated in a book de- 
voted to this topic, with its principal 
conclusion that: 

Much of vocational maladjustment is merely an 
aspect of the still bigger problem of emotional 
maladjustment. The reason why this fact has 
Not been adequately recognized by those inter- 
ested in yocational adjustment is simple; just 
because he does not know, the individual sel- 


dom assigns the true cause as the factor under- 


lying his vocational maladjustment. He is dis- 


Satisfied and simply because his work fails to 


Bive him the satisfaction he is seeking, he thinks 
his work is to blame (14, P- 21): 

‘The same authors asserted that nine 
times out of ten the individual, who is 
More or less constantly in a state of ex- 
Cessive emotional tension, is vocationally 
maladjusted (14). Another study hy- 
Pothesized that deep-seated personality 
Maladjustment might cause individuals 


to be vocationally dissatisfied, despite 
their possession of vocational interests 
similar to those successfully engaged in 
the same occupation (50). Kimball Young 
has asserted that the emotionally malad- 
justed worker's inability 

to adapt himself to the demands of the world 
reflect back into himself in terms of thwarted 
wishes, unhappiness, and feelings of failure—a 
discrepancy between his achievement апа his 
aspiration. And naturally such a person carries 
these attitudes or “emotional complexes” into 
new situations. Usually, the individual does not 
know why he is unhappy or dissatisfied or un- 
successful, and it is not uncommon for him 
to project blame for his maladjustment upon his 
job and the working situation (65, рр. 615-616). 


Shaffer has averred that only lately has 
recognition been given to the viewpoint 
that much vocational dissatisfaction is 
due to the existence of general personal 
maladjustment in individuals (51, рр. 
516-517). Quite recently, this topic has 
been discussed as follows: “The indi- 
vidual who is satisfied and adjusted with 
respect to his working situation is likely 
to be one who is well adjusted in his 
family, social and community life" (16, p. 
448). ; 
While not denying that personality 
maladjustment may cause job dissatis- 
faction, another view has been that either 
might cause the other and what would 
remedy one should remedy the other (25). 
Goodwin Watson’s results did not sup- 
ort the contention that poor morale on 
the job was largely due to personal mal- 
adjustment (61). Kornhauser’s opinion is 
that few workers were irremediably of 
low morale because of their emotional 


maladjustment, declaring that external 


variations are still influential (29). 
Very little has appeared in the litera- 
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ture relating work satisfaction to meas- 
ured interests, As Super has pointed out: 
In spite of the widespread acceptance of the 
hypothesis concerning the significance of voca- 
tional interests for personal and vocational ad- 
justment, surprisingly little research has been 
done in the prediction of satisfaction (54, р. 
364). 
Strong utilized vocational satisfaction as 
one of the criteria of validity of his Vo- 
cational Interest Blank, stating "Its 
validity should be measured in terms of 
satisfaction not while the person is pre- 
paring for the work but later on while 
he is engaged therein" (52, p. 384). 
Darley reported that the “fulfillment of 
the interest type (on the Strong Blank) 
represented a personally satisfying adjust- 
ment for the individual”, and that this 
type of satisfaction if realized on the job 
perpetuated. itself and was an important 
phase of adult adjustment (9, p. 51). 
The responses of the individual in 
terms of liking, disliking or indifference 
to the many items on an interest inven- 
tory are expressions of his drives and 
goals and give evidence of how he con- 
ceives of himself in relation to these 
(7, Бо). Vocational interests have been 
conceived as distinct but integral parts 
of the basic personality structure, suscep- 
tible to the same influences (4). 


THE PROBLEM: AIMS OF THE STUDY 


This study investigated the vocational 
interests, Rorschach responses, and job 
satisfaction of a group of workers em- 
ployed by the Federal government and 
engaged in routine clerical work. It at= 
tempted to discover whether those who 
professed greater satisfaction with their 
work could be distinguished on the 
clerical scale of the Strong Vocational] 
Interest Blank from those who indicated 
less satisfaction. At the same time, the 


question was raised whether the m 
ore 
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satisfied clerks could be distinguished 
from the less satisfied by the number of 
Rorschach signs of adjustment as evalu- 
ated by two objective scoring techniques, 
the Davidson List of Adjustment. Signs 
and the Munroe Inspection "Technique. 
Information was also sought on the re- 
lationship of the clerical scores on the 
Strong Vocational Interest Blank to the 
number of Rorschach signs of adjust- 
ment and maladjustment as appraised by 
the Davidson List and the Munroc In- 
spection Technique. The study included 
ап attempt to ascertain which, if any, of 
the specific Rorschach responses were 
related to work satisfaction. Finally, 
there was an exploration of the relation- 
ship of Rorschach responses of routine 
clerks to their scores on the Strong Vo- 
cational Interest Blank, clerical scale. 
Succinctly stated, this study undertook 
to discover the interrel | 


ationships among 
such factors 


as job satisfaction, vocational 
interests, and Rorschach signs of mal- 
adjustment, Those Ror: 


а schach responses 
if any, 


that were significantly related to 
job satisfaction and vocational interests, 
were segregated, 

The follow 


ing hypotheses were set UP 
to be tested: 


1. Job. satisfaction 
bears a significant pos 
to scores on the cler 
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clerks who possess in 
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of successful office workers ig not related 
to the number of Rorschach signs of mal- 
adjustment. я 

4. Certain. Rorschach responses, evi- 
denced in the records of routine clerical 


workers, are significantly associated with 
their possession of vocational interests 
similar to those of successful office work- 


ers. 
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ture relating work satisfaction to meas- 
ured interests. As Super has pointed out: 
In spite of the widespread acceptance of the 
hypothesis concerning the significance of voca- 
tional interests for personal and vocational ad- 
justment, surprisingly little research has been 
done in the prediction of satisfaction (54. р. 
364). 
Strong utilized vocational satisfaction as 
one of the criteria of validity of his Vo- 
cational Interest Blank, stating “Its 
validity should be measured in terms of 
satisfaction not while the person is pre- 
paring for the work but later on while 
he is engaged therein" 52, p. 384). 
Darley reported that the “fulfillment of 
the interest type (on the Strong Blank) 
represented a personally satisfying adjust- 
ment for the individual”, and that this 
type of satisfaction if realized on the job 
perpetuated itself and was an important 
phase of adult adjustment (о, P: 51). 
The responses of the individual in 
terms of liking, disliking or indifference 
to the many items on an interest inven- 
tory are expressions of his drives and 
goals and give evidence of how he con- 
ceives of himself in relation to these 
(7, 50). Vocational interests have been 
conceived as distinct but integral parts 
of the basic personality structure, suscep- 
tible to the same influences (4). 


THE PROBLEM: AIMS oF THE STupy 


This study investigated the vocational 
interests, Rorschach responses, and job 
satisfaction of a group of workers em- 
ployed by the Federal government and 
engaged in routine clerical. work. Tt at- 
tempted to discover whether those who 
professed greater satisfaction with their 
work could be distinguished on the 
clerical scale of the Strong Vocational 
Interest Blank from those who indicated 
less satisfaction. At the same time, the 
question was raised whether the more 


satisfied clerks could be distinguished 
from the less satisfied by the number of 
Rorschach signs of adjustment as evalu- 
ated by two objective scoring techniques, 
the Davidson List of Adjustment Signs 
and the Munroe Inspection Technique. 
Information was also sought on the re- 
lationship of the clerical scores on the 
Strong Vocational Interest Blank to the 
number of Rorschach signs of adjust- 
ment and maladjustment as appraised by 
the Davidson List and the Munroe In- 
spection Technique. The study included 
an attempt to ascertain which, if any, of 
the specific Rorschach responses were 
related to work satisfaction. Finally, 
there was an exploration of the relation- 
ship of Rorschach responses of routine 
clerks to their scores on the Strong Vo- 
cational Interest Blank, clerical scalc. 

Succinctly stated, this study undertook 
to discover the interrelationships among 
such factors as job satisfaction, vocational 
interests, and Rorschach signs of mal- 
adjustment. Those Rorschach responses, 
if any, that were significantly related to 
job satisfaction and vocational interests, 
were segregated. 

The following hypotheses were set up 
to be tested: 


1. Job satisfaction in clerical work 
bears a significant positive relationship 
to scores on the clerical scale of the 
Strong Vocational Interest Blank. Those 
clerks who Possess interests similar to 
those of successful office workers tend to 
be more satisfied. 

2. Job satisfaction in clerical work is 
not related to Rorschach signs of mal- 
adjustment. The degree of job satisfac- 
tion of clerical workers is independent of 
the number of Rorschach signs of mal- 
adjustment. 

3. The work dissatisfaction of routine 
clerical workers Who possess the interests 
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of successful office workers ig not related 
to the number of Rorschach signs of mal- 
adjustment. З 

4. Certain Rorschach responses, evi- 
denced in the records of routine clerical 
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workers, are significantly associated with 
their possession of vocational interests 
similar to those of successful office work- 


ers. 


CHAPTER II 


ج 


SURVEY OF THE LITERATURE 


PREVIOUS INVESTIGATIONS RELATED TO 
Tuts STUDY 


HE literature relative to job satisfac- 
Zw vocational interests, personality 
and Rorschach responses, when each of 
these is considered independently, is so 
extensive that it would be impractical to 
present a complete survey. Readers seek- 
ing fuller discussions on job satisfaction 
are referred to Hoppock and collabora- 
tors (18, 25); on the Strong Vocational 
Interest Blank, to Strong (52) and Carter 
(8); on the Rorschach Test, to Klopfer 
(28), Hertz (20, 23), and Beck (1). 


Job satisfaction has been discovered to be 
associated with: occupational level (53), a sense 
of fulfillment in constructive work (13), and the 
fulfillment of the social demands of the in- 
dividual (48). The evidence has been conflicting 
on the relationship of age to job satisfaction 
(30). No one single factor or group of factors 
operated alone in creating satisfaction, but it 
has been recognized that workers reacted to 
their understanding of the whole situation (29). 

Darley apparently confirmed his hypothesis 
that interest types are related to the develop- 
ment of personality in his study of 1000 sub- 
jects (9). The significance of some pertinent in- 
vestigations of the relationship between voca- 
tional interests and personality has been under- 
mined: (а) by the use of subjects who were 
not working, and (b) by the inclusion of oc- 
cupations, under one interest type, that are 
probably dissimilar in many traits, eg. artist 
and physicist, and correlating the scores on the 
interest type scale with personality measures. 

Dodge concluded that the more successful 
clerical workers in their self-evaluations seem 
to be less ready to assume responsibility, less at 
ease in social contacts, and more dependent 
upon others than the less successful workers 
(11, 12), within routine work situations. There 
was a marked tendency for psychoneurotic ex- 
soldiers to drift into clerical work after failing 
to adjust in other types of employment (44). 

Occupational dissatisfaction has been found to 
be associated with a lack of primary interest pat- 
tern in the current occupation as measured by 
Strong's Blank (50). Eighty-two percent of 241 
college graduates who entered Occupations for 


which high scores were earned on the Strong 
Vocational Interest Blank were satisfied (34). 


PREVIOUS INVESTIGATIONS RELATED TO THE ‘TESTS 
USED iN Tuis STUDY 


Rorschach Test 

While the Rorschach method has become onc 
of the most popular instruments for assessing 
the whole personality, much doubt has been cast 


upon its validity and reliability as a personality - 


test, Experimental evidence of the validity of 
the Rorschach method has been gained by giv- 
ing this test under experimentally altered. con- 
ditions and demonstrating its extreme sensitivity 
to changing conditions, attitudes, or emotional 
states (32). Others have compared the Rorschach 
results with outside criteria and have demon- 
strated marked agreement (6, 40). The case for 
the validity of the Rorschach Group Test rests 
upon the evidence for the individual test (23; 
24, 33). The reliability of the Rorschach Test 
has been studied using both testretest and 
split-half methods, with results permitting CON- 
flcting interpretations (15, 19, 20, 27, 55, 59) 

The psychological significance of the Ror- 
schach movement (M, FM) and the color (FC. 
CF, C) responses has not been subjected to carc- 
ful validation, and these responses or categorics 
have unfortunately been accepted and used DY 
many Rorschach workers despite the lack of 
demonstrated validity (22). There has been es- 
sential agreement in the meanings assigned (0 
the Rorschach human action (M) response (22, 
28, 49). The results of studies that corrclated 
the Rorschach Test responses with personality 
inventories are in disagreement with the con- 
cept of the human action (M) response as an 
indicator of maturity (21, 60). The validity of 
the human action (M) response has been pur- 
portedly demonstrated by hypnosis (gi), while 
some reliability coefficients for this Rorschach 
response have been ascertained to be .62 (59), 
“75 (20), апа .52 (27). 

Again, there has been essential agreement on 
the psychological significance of the Rorschach 
animal action (FM) response (g8), and the 
Rorschach color (FC, CF, C) responses (6, 28, 
49). There has been some evidence that emo- 
tional stability and adaptability varied directly 
with the excess of definite form-color (FC) res- 
ponses over indefinite (CF) and exclusive color 
(C) responses (21). There has been no confirma- 
tion that more human action (M) than animal 
action (FM) responses imply maturity (21, 60). 
Reliability of color responses in the Rorschach 


= 
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Test has been ascertained as .3@ (59), -81 (20), 
69 (15), and .68 No apparent reason scemed 
to explain. these discrepancies. 


The Munroe Inspection. Technique 


The Munroe Inspection Technique (до) is an 
objective method of evaluating Rorschach re- 
Sponses that yields numerical ratings indica- 
tive of the degree to which these responses de- 
Viate from what has been empirically determined 
to be adequate. By means of the Munroe In- 
Spection scores, Munroe was able to discriminate 
With a high degree of accuracy between stu- 
dents who were poorly adjusted and those who 
Were adequately adjusted. (40). Roe's investiga- 
lion demonstrated that while the usual methods 
Of Rorschach Test analysis were unsuccessful in 
picking out the excessive drinkers, the applica- 
tion of the Munroe Inspection Technique 
brought out clear differences (45). For 250 col- 
lege and high school students, there was good 
Correspondence between the Munroe Inspection 
Score and the degree of maladjustment as deter- 
mined by external indices (38). Fifteen “prob- 
lem” students selected independently of the Ror- 
schach Test had an average Munroe Inspection 
Score of 7.9 with a range of 5-10. A group of 
Students selected as unusually well adjusted, 
had an average score of 2.6 on the Munroe In- 
Spection List with a range of 1-4 (86). 


The Davidson List of Signs 

List of Signs was established 
о). Margu- 
as able 
cessful 


The Davidson 
Оп a group of 102 superior children @ 
lies, using the Davidson List of Signs, w 
to distinguish between scholastically success 
and scholastically unsuccessful boys of equiva- 


lent intelligence. (35). 


The Strong Vocational Interest. Blank 


The Strong Vocational Interest Blank has 


been called the "outstanding device of its kind 
(8 p. 14), For the appraisal of a vocational 
interest test, Strong asserted that he could think 
9f no better criterion than that of satisfaction en- 
during over a period of time (52) . Carter further- 


more stated that the real purpose of the voca- 
tional interest inventory is to predict vocational 
satisfaction and to indicate vocational preference 
(8). Strong has stated that those who are sat- 
isfactorily adjusted in an occupation may be 
differentiated by the test from those who are 
not satisfactorily adjusted and from those not 
engaged in the occupation. Due to the inade- 
quacy of measures of work satisfaction, Strong 
has assumed that continuance in an occupation 
is a measure of satisfactory adjustment, and if a 
person progressed on his job and was reasonably 
satisfied he was well adjusted vocationally (52). 
A test-retest correlation of .59 following an in- 
terval of three years was found for the Strong 
office worker key (58). Strong's reliability co- 
efficient after a five-year interval for the office 
worker scale was .74 while the odd-even tech- 
nique yielded a coefficient of .88 (52). 


The Job Satisfaction Blank 

The blank (Appendix A) used in this inves- 
tigation was almost identical to that devised by 
Hoppock for his study of job satisfaction (25). 
This blank required the subject to evaluate his 
fecings about the job as a whole. Having the 
subject do so placed the responsibility upon him 
of weighing all the job factors. Morale or satis- 
faction with work was exclusively the sum of at- 
titudes toward work situations and it was not 
known which of several aspects of a job de- 
noted morale (63). A forty-four item question- 
naire, sampling almost every area of the sub- 
ject’s work environment, produced substantially 
similar results in terms of job satisfaction as 
did one item requesting the subject to evaluate 
his feelings of satisfaction with the job (17). 

The split-half reliability of the job satisfac- 
tion blank was .93 after correction by the Spear- 
man-Brown formula from a raw correlation of 
87 (25). Berdie ascertained that the reliability 
coefficient, also split-half, was .97 (3). The writer 
carried out a study of бо subjects who answered 
the job satisfaction blank at intervals of three 
weeks. A test-retest reliability coefficient of .go 


was obtained. 


CHAPTER III e 


SELECTION OF SUBJECTS AND TESTS AND 
METHODS OF GATHERING DATA 


SELECTION OF SUBJECTS 


HE subjects for the study were 
| with a view to keeping to a 
minimum the variations in factors, other 
than the experimental ones, which might 
affect job satisfaction. The choices which 
were made, and the reasons for them, 
were as follows: 


1. A group of men was used, because earning 
à living more clearly represents a permanent 
activity for them. 

2. All subjects came from the same occupa- 
tional group so as to eliminate the effect upon 
job satisfaction of differences between occupa- 
tions. 

3. АП subjects enjoyed, presumably, the same 
general physical working conditions so that; dis- 
similarities in conditions that might influence 
job satisfaction were held to a minimum. 

4. All subjects earned annual salaries that did 
not vary too greatly so that the effect of salary 
differences upon job satisfaction was minimized. 

5. The subjects, for practical considerations, 
were engaged in an occupation for which a scale 
on the Strong Vocational Interest Blank had 
been established. 

One hundred male routine clerical workers 
employed in one establishment of the United 
States Government were selected. 

No selective factor was apparently operative 
in the choice of these subjects. Virtually all the 
male office workers within one administrative 
department were asked by the department head 
to report for the testing required for this study. 
There was no coercion. All those who wished 
participated. Many factors operated to bring 
about 100 percent cooperation. First and most 
important, neither the subject nor his test re- 
sults could be identified by the examiner. The 
subjects drew their test blanks in random order 
from the stack. Only numbers appeared on the 
papers. The appeal of one of the tests was great, 
due to the publicity given by many popular 
magazines and a recent moving picture. Others 
reported because it provided a welcome adven- 
ture during working hours away from their 
work duties. And finally, the spectacle of many 
of their work intimates eagerly volunteering im- 
pelled many others to present themselves. Suffi- 
cient time off from work was permitted so that 
there was no need for haste in taking the ex- 


amination. None of those who had been em- 
ployed for less than six months were included. 

The subjects averaged 29.7 years of age and 
had completed 12.1 years of schooling, as com- 
pared with Strong's criterion group for office 
workers which averaged 33.2 years of age and 
had attained a mean of 11.5 years of education 
(52). Only one routine clerk had completed no 
more than elementary school while four sub- 
jects had finished four years of college. Two 
clerks had one year of high school, cight had 
two years, thirteen had three years, while fifty- 
one completed four years of high school. There 
were eleven with one year, seven with two years, 
and three with three years of college work. 

Ninety-two percent of the routine clerks were 
born in this country with 74 percent giving 
New York City as their birthplace. Only cight 
other states were represented. Of the eight pet 
cent who were of foreign birth, two percent cach 
were born in Russia, Poland, Germany, and the 
British West Indies. All were citizens of the 
United States. Thirty-three percent of the sub- 
jects' fathers were born in this country. Italy 
as the fathers’ birthplace accounted for 36 pet 
cent with Russia having 15 percent. The British 
West Indies was represented by five percent and 
Poland had four percent. Norway, Hungary and 
Germany each showed two percent while Aus- 
tria had one percent. 

Utilizing the occupational scale devised by 
Beckman (2), the fathers’ occupations were classi" 
fied as follows: Grade I—Unskilled Manual OC 
cupations, Grade II—Semi-Skilled Occupations, 
Grade III —Skilled Manual and Skilled White Col- 
lar Occupations, Grade IV—Sub-Professional, 
Business and Minor Supervisory Occupatiot 
Grade V— Professional, Managerial, and Executive 
Occupations. This scale, according to the origina- 
tor who cited evidence, indicates the rank and the 
socio-economic prestige attached to a given ОС- 
cupation based on intelligence, skill, education, 
and training required for its pursuit. The OC 
cupations of the fathers of these clerks were 
distributed among the following grades in this 
proportion: Five percent were in Grade I, seven 
percent were in Grade II, 78 percent in Grade 
ПІ, six percent in Grade IV, and four percent 
in Grade V. Three of the four in Grade V were 
musicians. The great majority of the subjects 
fathers had been or were engaged in skilled. 
manual or white collar occupations. The afore- 
mentioned facts should provide a framework for 
understanding the socio-economic background of 
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these subjects and for evaluatingyresults of this 
study. 

The subjects were classified under Federal 
Civil Service principally as clerks, 10 as CAF-2, 
and 87 as CAF-g clerks, with just three in the 
CAF-4 category. The salary ranges were from 
e xd $2850 per annum with gs percent of 

he clerks earning between $2400 and $2700 
yearly. Each subject was entitled to salary in- 
crements for each eighteen months of satisfac- 
tory service. Their duties, as specified by the 
Civil Service announcement, were clerical. A 
more detailed description of their duties might 
afford a more appropriate perspective with 
Which to view the results of this study. Within 
One sub-group, the clerks were charged with 
dealing with forms embodying requests for aid, 
nd these forms, submitting these written 
un es en the professional staff for attention, 

issuing the necessary written authorizations. 
Another sub-group of clerks maintained per- 
sonnel records and requisitioned supplies. Ап 
additional sub-group devoted themselves to the 
processing of requests for payment of services 
furnished by outside agencies. The voucher sub- 
group maintained the master control cards, gov- 
erning the reimbursement limitations for use of 
community resources. The eligibility unit, in 
Which group there were many subjects, deter- 
а the eligibility of applicants for assistance 
uon office records, and took the necessary ac- 
ашпа to these applications pertaining to 
e Several were engaged in stenography, 
ЖБ] ing for the sub-group administrators. Other 
den ee dealt with the routine collection of 
in p and statistics. There was great flexibility 
task he assignment of these subjects to various 
asks. As the deputy administrator directly in 
charge of personnel allocation stated, "If а 
Man does not care for his work or finds it 
boring, we make adjustments. We transfer him 
to other types of work. We have never told a 
Sn this is your work—you have to do it i 
fouti The kind of work these clerks did was 

Sutine, but vital to the functioning of the or- 
tnetzation. They had supervisors with whom 
fied CHER. if the matter required any judg- 
Typin or decision not covered by procaine 
steno Е filing, sorting, checking through eap 
lated ene maintaining records, апа other r 

Man uties were required. dio d 
to ee as the routine clerks, fepe у, ques 
Yapher, cir fellow workers. One clerk, аа 
People stated that he found the P =m i 
agreed very congenial to deal with. Other бна 
sional with him and indicated that ve ae 
Stated оез treated them as equas, E 

В еге was a pretty wholesome relations р 
the professional personnel. A few clerks 
n ee about the pettiness of the supervisors 

€ unfairness of the efficiency ratings. Some 


&Xclai 


echoed one clerk’s declaration that “if I want 
to stick around and get along, I'd better keep 
quiet.” A small number indicated that personal 
worth was not appreciated. A few felt that they 
were underpaid for the jobs they performed. 
One clerk bitterly stated that the other clerks 
"don't have a mind of their own. They are will- 
ing to be led around." Despite a few such 
adverse indications, many objective signs of 
favorable work conditions were in evidence, such 
as an informal atmosphere, low turnover, and 
few disciplinary actions. Ample opportunity ex- 
isted for transfer to more congenial work duties 
and surroundings, if desired. On the whole, 
it appeared that the clerks performed their rou- 
tine duties in rather congenial working condi- 


tions. 
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The Rorschach Test 

The Rorschach Test, developed by Herman 
Rorschach (49), consists of ten standard inkblots, 
five gray and black and five colored cards. This 
test has become increasingly popular as an aid 
to differential diagnosis in clinical situations. It 
hasebeen used in such fields as anthropology, 
social case work, sociology, penology, educational 
and vocational guidance, and personnel selec- 
tion. It has been assumed by many that the 
mental procedure which the individual has 
adopted and the emotional reactions which he 
has displayed in the Rorschach situation have 
operated similarly when he met other life situa- 
tions. In his Rorschach Test reactions he has 
been thought of as giving a small scale sampling 
of his characteristic behavior (22). Results have 
obtained which, within certain limits, 
be subjected to statistical study and 
analyzed qualitatively to portray the presumed 
dynamics of personality or behavior One 
writer has declared that the Rorschach method 
has resulted in à configurational picture which 
has revealed the interplay between various 
major intellectual and emotional factors in the 
personality of the subject (28). It has been as- 
serted by some that the personality of the in- 
dividual, revealed in his projections or reac- 
tions, can be reconstructed. by the experienced 
Rorschach examiner who can gain knowledge 
of mental procedures, motives, interests, de- 
sires, fantasy life, emotionality, ability, and the 
like (1, 22, 28); others, on the other hand, have 
he need for caution in interpret- 
ing Rorschach responses, because of the sub- 
jectivity of most of the evidence on which in- 
terpretations have been based (22, 66). For this 
reason, this study has utilized the Rorschach 
only аз an instrument yielding certain objectively 
measurable indices, which might be related to 
other variables such as inventoried interest and 


expressed job satisfaction. 
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The Munroe Inspection Technique. 


Аз an important outgrowth of her study of 
the validity of the Rorschach as a "clinical" 
method, Munroe (40) has developed a special 
and relatively objective procedure for dealing 
with Rorschach protocols, which she has termed 
the Inspection Technique. In her preface, Mun- 
roe has stated: 


For the single purpose of providing a measure 
of what—for want of a better word—we have 
called adjustment it seems probable that the 
Rorschach can be handled like an ordinary psy- 
chological test with the customary attention to 
set scoring standards and with at least very im- 
portant reduction in the role played by the 
judgement of the examiner. Moreover, the meas- 
ure of adjustment can be expressed quantita-, 
tively either as a rating or (probably) as а score 
obtained by adding up the entries made on a 
prepared check list (до, p. 6). 


The quantitative measure of Rorschach ad- 
justment in this study was obtained by evaluat- 
ing the Rorschach records of the 100 routine 
clerks on the basis of the revised Munroe In- 
sSpection Technique (35). This check list con- 
sisted of the major categories commonly wsed 
in Rorschach scoring and for Rorschach inter- 
pretation. Each category had fixed limits for 
What have been clinically considered as "nor- 
mal" Rorschach responses by Rorschach examin- 
ers. Specific numbers of entries were made for 
each category according to the extent of devia- 
tion from these "normal" limits. 

Rorschach areas of "adequate" functioning 
included those responses that were within the 
empirically established “normal” range of 
Rorschach responses as shown in each of the 
categories on the Munroe Inspection List. Clin- 
ically, these responses have been found to carry 
no indications of personality maladjustment. 
The subject's performance within specific cate- 
gories or areas was examined to see if it was 
within the predetermined “normal” range. If 
within “normal” limits, no entry was made for 
that specific Munroe category. This meant that 
the performance of the subject with regard to 
that Munroe category was “adequate.” “Ade- 
quate” will have this meaning throughout the 
study when applied to Rorschach scores, 

Rorschach areas of maladjustment embraced 
those responses that were evaluated as falling 
outside the empirically established “normal” 
range of Rorschach responses for each of the 
separate Munroe Inspection List categories. The 
degree of deviation of these responses from the 
predetermined "normal" Tange was handled by 
making one, two, or three entries for the par- 

. ticular Munroe Inspection List category. The 
sum of the entries (or the Scores) on the Mun- 
roe list for each subject will be referred to 
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throughout the study as the subject's Rorschach 
signs of mala&justment. Each of the categories 
in the Munroe Inspection Technique was eval- 
uated in relation to the Rorschach performance 
of the subject as a whole. A low score on the 
Munroe Inspection List showed that the sub- 
ject had given few responses clinically considered 
to be indicative of maladjustment. A high score 
on the Munroe Inspection List signified that 
there were many areas in which he displayed 
many deviations from the clinically established 
“normal” range. 

The instructions set forth for using the Mun- 
roc Inspection Technique were followed in this 
investigation. Although Munroe emphasized the 
importance of using clinical judgment in the 
event that a percentage approached significance 
in terms of plus, minus, or check scores, the 
writer decided to adhere strictly to the arith- 
metical limits for making the number of entries 
set forth in the directions for the application 
of the check list. The value of the writer's pro- 
cedure in terms of making the Munroe Inspec- 
tion List a more objective measure of Rorschach 
adjustment was considerable. Also following the 
Munroe Inspection Technique instructions, an- 
other qualified Rorschach examiner and clini- 
cal psychologist (Miss C. Mildred Ceres) in- 
dependently evaluated the protocols, The agrec- 
ment of the Munroc Inspection scores that she 
obtained with those obtained by the writer was 
considerable, the correlation coefficient being 
:94- It was believed that the merit of this result 
Offset the few advantages that using clinical 
judgment might have brought. 


The Davidson List of Signs. 


A high score on this list signified having many 
Rorschach signs of adjustment while a low score 
meant the lack of possession of Rorschach signs 
of adjustment. The Davidson List of Signs (10) 
was introduced into this investigation to ascer- 
tain the extent to which it yielded results simi- 
lar to those obtained by the Munroe Inspection 
Technique. It was not as extensive as the Mun- 
toe Technique nor did it attempt to distinguish 
as finely between the differences in quality and 
quantity of Rorschach responses as did the 
Munroe. Finally, the Davidson List operated on 
the principle of the Presence or absence of 
Rorschach signs of adjustment while the Munroe 
Technique assigned specific values to the de- 
gree of presence or absence of specific Rorschach 
factors of maladjustment. 


Procedure Followed for Administering the Ror- 
schach Test. 


The procedure followed for administering the 
Rorschach Test was as follows: Each subject 
Was given a set of Rorschach cards arranged in 
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order, face down, and the instrustions suggested 
by Klopfer (28) were followed, substituting the 
words "write out" for “tefl me." All answers 
Were written in a special booklet with each sub- 
ject taking as much time as necessary with each 
of the cards, turning each card if he so desired 
and was so inclined. After the subject had re- 
sponded to all the cards, ап individual inquiry 
was conducted with each subject separately so 
that most of the essential features of the stand- 
ard individual Rorschach procedure were re- 
tained. The subjects were tested in groups of 
four or five and adequate supervision was pro- 
vided by the presence of at least two examiners. 


The Strong Vocational Interest Blank 


The Strong Vocational Interest Blank was 
utilized to measure the similarity of the voca- 
tional interests of the 100 routine office clerks 
to those held by successful office workers. The 
standard procedure as prescribed by Strong (52) 
was followed. This blank consists of 4oo items 
Which included names of occupations, school 
Subjects, amusements, activities, peculiarities of 
people, order of preference of activities, types 
of famous persons most preferred, choice be- 
tween two paired activities, and self ratings of 
present abilities and characteristics. The items 
were marked according to one's likes, dislikes, 
indifferences, choices, and preferences. The oc- 
Cupational scale for the general office worker, 
as for the other occupations, was obtained by 
contrasting the responses of the criterion group 
to the доо items of the blank with those of 
the men-in-general group, and assigning rela- 
live weights to those items that displayed sub- 
Stantial differences. The raw score, obtained by 
the subject, is the sum of weights he earned 
оп the office worker scale of the Strong Blank, 
Which sum might be positive or negative. The 
raw score was converted into à standard score 
Which indicated just where the subject placed 
With respect to the distribution of scores of the 
criterion group on that scale. The score earned 
9n the general office worker scale was а measure 
of how similar the subjects interests were to 
those of the average successful office workers. A 
high score signified that the subject possessed 
interests similar to the successful office worker. 
A high score was said to be associated with long 
continued employment and with occupational 


adjustment was said to typify 
s and a low score 
dicated that 


the reverse conditions (52). Strong indicat 

there was surprising permanence of interests 
found among adults when vocational e t 
Was defined not as a single selection but ° к 
Sum tota] of many interests that bore in any 
Way upon an occupational career (52)- S 
has stated, “men engaged in occupations so far 


studied have a characteristic pattern of likes 
and dislikes which differentiates them from men 
in other occupations" (52, p. 47). 


The Job Satisfaction Blank 


The job satisfaction blank (Appendix A) was 
modeled after and differed very little from that 
devised by Hoppock (25). This blank consisted 
of four questions, each having seven possible 
alternatives, These questions required that the 
subject evaluate himself in connection with how 
well he liked his job, how he compared with 
other people in his liking for his job, how he 
felt about changing his job, and how much of 
the time he felt satisfied with his job. Job satis- 
faction had been defined as any combination of 
circumstances, psychological, physiological, and 
environmental that made a person state truly, 
“I am satisfied with my job" (25, р. 47). By hav- 
ing the subject explore and evaluate his feelings 
in response to the four questions on the afore- 
mentioned blank, it was believed that a valid 
measure of his job satisfaction would be obtained. 
Rationalization and self-deception may have ne- 
gated to some extent the subject's own estimate of 
hi? job satisfaction. Nevertheless, what the sub- 
ject thought and felt about his job must neces- 
sarily be real despite the origin of such feelings. 
It was these subjective feelings that this study 
endeavored to ascertain so that they may be re- 
lated to the other measures of the individual's 
functioning—the Strong Vocational Interest 
Blank, and the Rorschach Test. As Horst has 
declared, success in an activity may be considered 
in terms of the satisfaction derived from participa- 
tion in the activity by the person engaged therein 
(26). In the majority of prediction studies, how- 
ever, measures of success have been defined largely 
in terms of achievement or conformity to norms 
set by others, rather than in terms of the effect 
of the activities on the person engaging in them 
(26). Situational factors, Horst added, must be 
understood in the way they were defined by the 
individual himself (26). A true estimate of job 
satisfaction must come from the subject, not- 
withstanding the degree of his indulgence in 
rationalization and falsification. 

The score on the job satisfaction blank was 
the sum of the separate weights for each of the 
four questions. Each question was given a weight 
equal to the number of the alternative that was 
marked by the subject. Hoppock used this method 
for arriving at job satisfaction scores after dis- 
covering that a more complicated and ostensibly 
more accurate method of assigning weights gave 
almost identical results, the correlation between 
the two methods equaling .98 (25). Scores could 
range from a theoretical low of 4 to a theoretical 


maximum of 28. 


CHAPTER IV @ 


RESULTS Ë 


Јов SATISFACTION MEASUREMENT 


HE range of scores of routine clerks 
"Lus the job satisfaction blank was 
from 5 to 26, signifying that there were 
subjects who were intensely satisfied with 
their jobs as well as greatly dissatisfied. 
The mean of the group was 17.49 points 
and the standard deviation was 3.80. 
Table I gives the distribution of the job 
satisfaction scores. The group median 
fell at 18, while the mode was 16, the 
latter score having the top frequency of 
17. After arbitrarily fixing 18 as the low- 
est score indicative of satisfaction, it was 


TABLE I 
DISTRIBUTION OF JOB SATISFACTION SCORES 

Score Frequency 
7 to ro 6 
II tO 13 8 
14 to 15 то 
16 to 17 23 
18 to 20 31 

t 21 to 23 18 
24 to 26 4 


N= 100 Clerks 
Mean=17.5 
Standard deviation = 3.8 


noted that 53% of the routine clerks 
achieved that score or higher. The per- 
centage of the indifferent, those earning 
scores of 14 to 17, was 33, whereas the 
dissatisfied, those with scores of 13 and be- 
low, constituted 14 percent of the group. 


A certain minimum of intelligence might be 
prerequisite for satisfaction in any occupation. 
Uhrbrock has shown that there was a small in- 
significant negative correlation (—.14) between 
job satisfaction scores and Otis intelligence 
scores of 99 clerks (57). Probably, once the mini- 
mum intelligence level for an occupation was 
surpassed, additional increments of intelligence 
had little effect. The number of years of educa- 
tion of the clerks under current investigation 
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was correlated with their job satisfaction scores 
and the resulting correlation was —.12, agree- 
ing substantially with Uhrbrock (57) No re- 
lationship, hence, for routine clerical workers 
seemed to exist between job satisfaction and 
education. 


STRONG VOCATIONAL INTEREST SCORES 


The scores on the office worker scale of 
the Strong Blank ranged from ээ to 64. 
(The standard scores were utilized in- 
stead of raw scores due to the occurrence 
of many negative scores.) The mean score 
fell at 44.7 with a standard deviation 
of 9.7 (Table II). This was compared 
with the mean score of 50 and standard 
deviation of 10 of the office workers used 
by Strong as his sample. A difference 
significant at the 1 percent level existed 
between the means of the two groups. 
When the distributions of both groups 
according to letter ratings were con- 
tasted, a difference significant at the 1 
percent level was revealed (Table IIT). 
Both means and the letter grade distri- 
butions of the two groups were signifi- 
cantly different. Strong's sample of 326 
workers, drawn from all parts of the 


TABLE II 


DISTRIBUTION OF STRONG VOCATIONAL 
INTEREST CLERICAL SCALE SCORES 
AND RATINGS 


Standard Score Rating Frequency 
22 to 24 C 2 
25 to 29 C+ 7 
30 to 34 B- 8 
35 to 39 B 14 
40 to 44 B+ 11 
45 to 49 A 10 
501054 А 27 
55 to 59 A 7 
60 to 64 A 5 


М = тоо Clerks 
. Mean Ѕсоге= 44.7 
Standard deviation 29.7 
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e TABLE Ш 


CLERICAL INTEREST RATINGS OF 100 RourINE CLERKS AND THE 
CRITERION GROUP ON THE STRONG BLANK 


Letter Ratings 


Group 


А B+ B B—,C+,C 
Routine Clerks 58 1.0 14.0 17.0 
Criterion Group 7o 12.3 9.8 7.8 


Chi-square — 14.84 


3 degrees of freedom 


Significant at the 1% level 


United States, was composed of 214 office 
clerks, bookkeepers, stenographers, 9? 
office managers, and 20 credit managers. 
Age and educational level of the two 
groups were about the same or equal. 
More than one third of Strong's group 
Were subjects charged with the responsi- 
bility of making decisions, using initia- 
tive, or carrying on administrative duties. 


ReELATIONsHIP BETWEEN JOB SATISFAC- 
TION Sconrs AND VOCATIONAL INTEREST 
SCORES 


The correlation coefficient between job 
satisfaction scores and the scores on the 
clerical scale of the Strong Vocational 
Interest Blank was .21. This correlation 
Was not significant at the one percent 
level but was significant at the five per- 
cent level. For this particular group of 


clerical workers, there was only а slight 
ree of satisfac- 


relationship between deg 

tion with a routine clerical job and the 
score on the clerical scale of the Strong 
Blank. Vocational interests similar to 
those of office workers could be а factor 
contributing to satisfaction with routine 
clerical work but to a slight degree only. 
The partial correlation between job satis- 
faction scores and Strong clerical inter- 
est scores with the Munroe Inspection 
Technique total adjustment scores held 
Constant, was „эз. This latter coefficient 
Was not significant even at the five per 


cent level. Hence, there was no evidence 
to support a relationship in this group 
of routine clerks between job satisfaction 
scores and Strong clerical interest scores 
when Munroe Inspection scores were 
held constant. 

Strong stipulated that the criterion of 
validity for his test was not a high de- 
gwe of job satisfaction but “reasonable” 
satisfaction in contradistinction to in- 
difference and dissatisfaction (52). A 
threefold division of job satisfaction was 
made by assigning all scores of 18 and 
above to the satisfied group, scores of 14 
to 17 inclusive to mark indifference, 
while scores of 13 or below typified dis- . 
satisfaction. Following Strong's definition 
that all those with A and B+ scores were 
to be considered as possessing interests 
similar to those of the average successful 
office worker, the group was dichoto- 
mized into those achieving А and B+ 
scores and those who did not. Table IV 
shows the distribution and the chi-square 
value of 2.89, with 2 degrees of freedom. 
There was no evidence that vocational 
interest scores Were related to job satis- 
faction ratings. While there was a very 
slight tendency for numerical values indi- 
cative of vocational interest and job 
satisfaction to be correlated, when these 
measures were divided into meaningful 
categories no evidence for them to be 
related was found. Analysis of variance 
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TABLE IV а 


DISTRIBUTION OF THE STRONG CLERICAL INTEREST RATINGS MADE BY 100 
CLERKS ACCORDING TO THEIR Јов SATISFACTION RATINGS 


Job Satisfaction Ratings 


Clerical Interest Ratings 


Dissatisfied Indifferent Satisfied 
(o to 13) (14 to 17) (18 to 28) 
А and B+ 
(до апа оуег) 23 39 
В, B—, C+, and С i 
(39 and less) то 14 


Chi-square = 2.89 


2 degrees of freedom 


Not significant at the 596 level 


also disclosed that there was no evidence 
for a significant difference between the 
clerical interest scores of satisfied, in- 


different and dissatisfied routine clerks 
(Table V). 


Јов SATISFACTION AND RORSCHACH 
RESPONSES 

Job Satisfaction Scores and Davidson List 
of Adjustment Signs 

Each of the Rorschach protocols of 
the 100 subjects was scored according to 
the system utilized by Klopfer (53). The 
protocols were evaluated on the basis of 
16 of the 17 signs on the Davidson List 
(10). The sign, total number of Ror- 
schach responses equal 20, was elimi- 
nated because there was some evidence 
that the conditions of group Rorschach 
administration, followed in this study, 
might have militated against the produc- 
tion of 20 or more responses (24). The 
mean number of Davidson adjustment 


signs for the clerical group was 7.91, the 
standard deviation 2.31. This fell be- 
tween the mean of 6.85, standard devia- 
tion 2.27, made by scholastically unsuc- 
cessful boys and the mean of 9.05; 
standard deviation of 2.70 of successful 
boys (35). While the routine clerks were 
not too different from Margulies’ (35) 
total group of children, when compared 
with Davidson's original group of 102 
superior children (10), the routine clerks 
were shown to have fewer Rorschach 
signs of adjustment, 7.91 versus 10.31- 
Tests for the significance of the differ- 
ences between the means were not made 
because of the difference in age of the 
groups and the omission of one sign of 
the Davidson List. 

The coefficient of correlation between 
the job satisfaction scores and the total 
number of Davidson signs of Rorschach 
adjustment was —.16. As this coefficient 
was not significant, there was no evidence 


TABLE V 


ANALYSIS OF VARIANCE OF STRONG CLERICAL INTEREST SCORES OF тоо CLERKS DIVIDED 
ACCORDING TO SATISFIED, INDIFFERENT, AND DISSATISFIED Јов RATINGS 


Source d.f. Sum of Squares Mean Squares Р 
Between Means 2 497 248.5 
Within Groups 97 8907 91.5 2.7 


F is not significant at 5% level 
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of a relationship between self-evaluation 
of job satisfaction and th number of 
Davidson signs of adjustment. Satisfac- 
tion with routine clerical work and the 
Rorschach responses believed by David- 
son to be indicative of adjustment did 
not appear to be related. 


Job Satisfaction Scores and Munroe In- 
Spection Scores. 


9 The Munroe Inspection Technique 
total adjustment” score indicated the 
number and extent of the deviations of 
the Rorschach Test responses of a subject 
from those of the empirically established 
Rorschach norms. The higher the score, 
the more numerous the Rorschach re- 
Sponses considered indicative of malad- 
justment, The lower the score, the more 
normal. the performance of the subject 
on the Rorschach. The mean “total ad- 
justment” score of the clerks on the 
Munroe Inspection Technique was 
found to be 12.96 with a standard devia- 
tion of 4.40. Scores ranged from а low 
of 2 to a maximum of 23. The mode fell 
at two scores, 12 and 13 (Table VI). The 
median score was 13- Eighty college stu- 


dents studied by Munroe (39) earned an 
average score on the Munroe List of 
the same, and 


9.0, with the median about 
their range was 2 to 17- Roe found that 


TABLE VI 


DisTRIBUTION or MUNROE INSPECTIO 
NIQUE SCORES FOR 100 LERKS 


N TECH- 


Score Frequency 
2to 4 2 
8 
sto 7 
8 to то 18 
11 to 12 a 
13 tO I5 
16 to 18 16 
19 to 21 7 
Mean= 12.96 


Standard deviation =4-4 


the average Munroe Inspection score for 
a group of artists was 10.3, and that the 
range was 3 to 18 (46). Scientists averaged 
7.7 signs, with a range of 1 to 15, and 
technicians scored a mean of 9.4 signs 
with a range of 2 to 19 (47). It can be 
readily seen that the routine clerks now 
under study have consistently more Ror- 
schach signs of maladjustment than the 
other groups. These studies did not re- 
port sufficient data for tests of signifi- 


cance. 


It seemed desirable to learn if the Munroe 
Inspection Technique utilized by another psy- 
chologist would yield similar results when ap- 
plied to the Rorschach protocols. Following the 
scoring of the Rorschach protocols according to 
the customary Klopfer method, thirty of them 
were submitted to a clinical psychologist, ех- 
perienced in Rorschach examination (Miss C. 
Mildred Ceres). She independently rechecked 


tk writers scoring, making alterations and 
changes, when necessary, and, 


without knowl- 
edge of the writers Munroe Inspection Tech- 
nique scores, evaluated each of the protocols 
based on this technique. The above procedure 
was followed to appraise the reliability and 
similarity of the Munroe Inspection Technique 
scoring rather than the reliability and similarity 
of the general Rorschach scoring. The coeffi- 
cient of correlation between the independently 
computed “Munroe Inspection Technique scores 
was .94» with perfect agreement in go percent 
of the yecords. Thirty percent of the records 
differed by 1 entry, 33 percent of the records 
differed by 2 entries, and seven percent differed 
by 3. No score differed by more than three. The 
writer's own reliability coefficient was .96 for the 


first 20 records rescored after an interval of three 


months. 

The coefficient of correlation between 
the job satisfaction scores and the Mun- 
roe Inspection scores was calculated to 
be .02, which was not significant. The 
correlation coefficient between 
yo variables with Strong clerical 
interest scores held constant (—.06) was 
not significant. The Rorschach signs of 
maladjustment, as appraised by the Mun- 
roe Inspection Technique, were not re- 
Jated to job satisfaction scores. This 


partial 
these tv 
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TABLE VII 


ANALYSIS OF VARIANCE OF JOB SATISFACTION SCORES OF 100 CLERKS DIVIDED ACCORDING 
TO RELATIVELY FEW (o TO ro), MODERATE (11 TO 15), AND MANY (16 AND OvER) 
RORSCHACH SIGNS OF MALADJUSTMENT ON THE MUNROE INSPECTION List 


Source df Sum of Squares Mean Square F 
Among Means % 17.47 8.74 
Within Groups 97 1427.53 


14.72 0.59 


F is not significant at 5% level 


result confirmed the result reported 
above with the Davidson List of Adjust- 
ment signs. 

The Munroe Inspection Technique 
scores were divided into three categories, 
the scores falling between o and 10 indi- 
cating relatively few Rorschach signs of 
maladjustment, the scores between 11 
and 15 revealing relatively moderate Ror- 
Schach signs of maladjustment, and 
scores of 16 and over representing rel- 
tively many Rorschach signs of malad- 
justment. The corresponding job satis- 
faction scores were tabulated and the 
analysis of variance technique was ap- 
plied to ascertain if the scores were sig- 
nificantly different among the three cate- 
gories of Rorschach adjustment. No 
significant differences were discovered to 
exist among the job satisfaction scores 
of these three categories of Rorschach 
adjustment (Table VII). Tests of signifi- 
cance utilizing Chi-square also revealed 
no relationship between Munroe In- 


spection scores (Table VIII). Analysis of 
variance and chi-square procedures were 
utilized in addition to the Pearson pro- 
ductmoment correlation inasmuch as 
there might have been some doubt about 
the Munroe Inspection List as a continu- 
ous measure of Rorschach adjustment. 


Job Satisfaction Scores and Munroe In- 
spection List Categories 


The following description of results 
pertains to the separate categories on the 
Munroe Inspection List and their corre- 
lations with job satisfaction scores, In 
some cases, arbitrary weights have been 
assigned to specific entries [or each cate- 
gory used, in order to calculate product- 
moment correlations. For the biserial 
correlations, certain groupings of entries 
under each category were deemed ar- 
bitrarily right and others wrong, but only 
for purposes of calculation. Appendix B 
should be consulted for these arbitrary 
weightings and determinations. 


TABLE VIII 


Јов SATISFACTION SCORES or тоо CLERKS SHOWING RORSCHACH SIGNS 
OF MALADJUSTMENT ON THE MUNROE INSPECTION LisT 


Rorschach Signs of Maladjustment 


Job Satisfaction Scores 


Relatively Few 


Relatively Moderate Relatively Many 


(o to ro) (11 to 15) (16 and over) 
18 to 28 12 17 14 
14 to 17 то 13 10 
o to I3 6 5 3 


Chi-square = 3.26 


4 degrees of freedom 


Not significant at 5% level 
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| The coefficients of correlation between 
Job satisfaction scores and e Munroe 
Inspection List categories have been indi- 
cated in Table IX. No опе of these corre- 
lations was statistically significant. There 
was no evidence, then, for any relation- 


TABLE IX 


CORRELATION COEFFICIENTS BETW 
SATISFACTION Scores AND М 
Inspection List CATEGORY ENTRIES 


Pearson 


4 kaá 

Hen hama ie, | —- 
NI Catagories Correla- tions 

tions 

Color to Movement E —.07 
CF:FC ement E or 
FC .o6 .o3 
FM:M —.o6 —.II 
M Kn үч 
Total Color = E 
T otal Movement — —.13 
FZ Fc -— .05 
0 — .o1 
Range шоя "m 
opular — .05 
d ash EN 
W ES EA 


Ship between the individual Munroe In- 


Spection List categories and job satisfac- 
tion scores of routine clerks. 
ina- 


Job Satisfaction Scores and the Comb x 
vith 


tion of Munroe Inspection Scores d 
Str 1 ^Y res. 
Strong Vocational Interest Sco 


It was desirable to ascertain whether 
Certain combinations of Rorschach re- 
Sponses and vocational interests distin- 
8uish routine clerks who had satisfied 
Scores from those whose scores indicated 
indifference to or dissatisfaction with 
their work, There was no significant dif- 
ference in the number of routine clerks 
with Strong clerical interest scores of 40 
Ог above and Munroe Inspection scores 
of 16 or above who had job satisfactio" 
Scores of 18 or above, on the one hand, 
and those with the same combination of 


Strong and Munroe scores who had job 
satisfaction scores of 17 or below, on the 
other. There was no evidence that job 
dissatisfaction or indifference of routine 
clerks who possessed interests similar to 
those of successful office workers was as- 
sociated with Rorschach responses evalu- 
ated by the Munroe List as indicative of 
relatively many signs of maladjustment 
(Table X). 


VOCATIONAL INTERESTS AND RORSCHACH 
RESPONSES 


Vocational Interest Scores and the David- 
son List of Adjustment Signs. 


The correlation coefficient between the 
Strong office worker interest scores and 
the total number of Davidson signs of 
Rorschach adjustment was —.54, а sig- 
nificant correlation. The possession by 
routine clerks of vocational interests 
similar to those of successful office work- 
ers was associated with a low number of 
Rorschach signs of adjustment, as evalu- 
ated by the Davidson List. 


Vocational Interest Scores and the Mun- 
roe Inspection List 

The correlation coefficient between the 
Strong clerical interest scores and the 
Munroe Inspection scores was .33, which 


TABLE X 


Тов RATINGS OF 100 CLERKS WHO POSSESSED OR 
Dip Not Possess THE COMBINATION OF STRONG 
CLERICAL INTEREST SCORES OF до OR ABOVE 
AND MUNROE INSPECTION SCORES 

OF 16 OR ABOVE 


Dissatisfied 
Satisfied and 
(18andabove) Indifferent 
(17 and below) 


Pattern 12 то 


No Pattern 41 37 


Chi-square=-027 I degree of freedom 
Not significant at 5% level 
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TABLE XI 


q 
DISTRIBUTION OF STRONG CLERICAL INTEREST RATINGS MADE BY тоо CLERKS SHOWING 
RORSCHACH SIGNS OF MALADJUSTMENT ON THE MUNRO2 INSPECTION List 


Relatively Relatively Relatively 
Clerical Interest Ratings Few Moderate Many 
(o to 10) (11-15) (16 and over) 
A and В+ 
(40 and over) 13 34 22 
B, B—, C+, C 
(39 and less) 15 її 5 


Chi-square— 9.52 


2 degrees of freedom 


Significant at 1% level 


was significant at the 1 percent level. The 
partial correlation coefficient between the 
same two factors when job satisfaction 
scores was held constant was the same, 
-33, again significant. The possession of 
interests similar to those of successful 
clerks tended to be associated with Ror- 
schach responses evaluated by the Mun- 
roe List as indicative of many signs of 
maladjustment. This conclusion agreed 
with that reached in connection with the 
relationship between the Davidson List 
of signs of adjustment and the Strong 
clerical interest scores. The presence of 
the tendency for high clerical interest 
scores to be associated with high Munroe 
Inspection scores was also shown by the 
chi-square technique (Table XI). 


Strong Vocational Interest Scores and 
Munroe Inspection List Categories. 


The following description of resuits 
refers to the separate categories on the 
Munroe Inspection List and their corre- 
lations with the Strong clerical interest 
scores. For the arbitrary weights and the 
arbitrary assignment of right and wrong 
answers for purposes of computation, 
Appendix B should be studied. 

The coefficients of correlation between 
Strong clerical interest scores of routine 
clerks and the individual Munroe In- 


spection List categories have been indi- 
cated in Table XII. 

In the Klopfer scoring system, a re- 
sponse was scored as “FC” if definite 
form was fused with color, and a “Ch” 
score was assigned where the form was 
indefinite and the color important (28). 
The product moment correlation be 
tween the entries for the Munroe In- 
spection List category CF:FC and scores 
on the clerical interest scale of the Strong 
Blank was —.41 while the biserial corre 
lation coefficient was —.52. The differ- 
ences in correlation coefficients here and 
below, while not great, might be attri- 


TABLE XII 


CORRELATION COEFFICIENTS BETWEEN STRONG 
VOCATIONAL INTEREST SCORES AND MUNROE 
INSPECTION List CATEGORY ENTRIES 


Munroe Inspection Pearson EOR 
List Categories Product Biserial 
8 Moment 
Color to Movement .09 —.06 
СЕ:ЕС —.41 —.52 
EC „29, .40 
ЕМ:М —.40 —.80 
M E C +53 
Total Color — —«02 
Total Movement — —.09 
FK, Fc — лі 
F% — к E 
Range — 2 
Popular Responses = +23 
d -- 3108 
w — 414 
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buted to the probable lack 9t normality 
resulting from the arbitrary assignment 
of weights for the calculation of the 
product moment correlations. The 
higher the score on the Strong clerical 
interest scale, the greater was the 
tendency of routine clerical workers to 
give preponderantly indefinite form- 
color (CF) in preference to definite form- 
color (FC) responses to the Rorschach 
ink blots. 

The product moment correlation be- 
tween the entries for the Munroe In- 
spection List category FC and scores on 
the clerical interest scale of the Strong 
Blank was .27 which was significant at 
the 1% level. The biserial correlation 
was до. The higher the Strong clerical 
interest scores of routine workers the 
greater was the tendency for the less fre- 
quent use of definite form-color (FC) 
responses in the Rorschach records. 

A score of FM referred to the spon- 
taneous response by a subject in which 
he reported perceiving an animal in 
action or in a live posture, while M ap- 
plied to responses embodying the concept 
of a living being in human oF human- 
like action, The product moment corre- 
lation coefficient between the entries for 
the Munroe Inspection List category 
FM:M and scores on the clerical interest 
Scale of the Strong Blank was —.40 
Which was significant, while the biserial 
Coefficient was —.50. These correlations 
revealed that, for routine clerks, high 
E clerical interest scores tended to 

associated with a preponderant use 
of animal action (FM) responses over 
Muman action (M) responses. 
e Er product moment сопе, 
jos = between the entries for nape 
Scores Nspection List category ics ibi 
bon on the clerical interest sca er 

8 Blank was .55 while the biser 


elation co- 
he Mun- 


coefficient was .53. In this sample of rou- 
tine clerks, high scores on the clerical 
interest scale of the Strong Blank tended 
to be associated with infrequent use of 
human action (M) responses. 

There are ten popular responses in 
the Klopfer system, described in his 
manual (28). The biserial correlation 
coefficient between the entries for the 
Munroe Inspection List category popu- 
lar responses and scores on the clerical 
interest scale of the Strong Blank was .23, 
significant at 5% level. There was a slight 
but significant relationship between the 
lack of use of four or more popular re- 
sponses and the possession by routine 
clerks of interests similar to those of 
successful office workers. 


te 
INTERCORRELATIONS AMONG CERTAIN 
RORSCHACH FACTORS 


Davidson List of Adjustment Signs and 
the Munroe Inspection Technique. 

The Davidson List of signs of adjust- 
ment was introduced to ascertain the 
extent to which another objective 
method of assessing Rorschach adjust- 
ment would yield results similar to those 
of the Munroe Inspection Technique. 
The product moment correlation coeffi- 
cient for the two measures was —.764, 
demonstrating a high relationship. А 
high Munroe score (many Rorschach re- 
sponses considered signs of maladjust- 
ment) strongly tended to be associated 
with a low Davidson score, representing 
the absence of Rorschach signs of mal- 
adjustment. The correlation coefficients 
of each measure with job satisfaction and 
vocational interest scores tended to agree. 


The Munroe Inspection List Categories. 

For several of the categories of the Munroe 
Inspection List which correlated significantly 
with the Strong clerical interest scores, intercorre- 
lations were calculated, using the product-moment 
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and tetrachoric methods, The arbitrary weights 
and assignments of right and wrong answers 
are shown in Appendix B. These intercorrelations 
were essential for arriving at a multiple correla- 
tion coefficient for predicting Strong clerical in- 
terest scores from a combination of five Rorschach 
factors (Table XIII) The significant relation- 
ships shown in Table XIII indicated that: 

1. The greater the preponderance of indefinite 
form-color (CF) responses to definite form-color 
(FC) responses, the smaller was the number of 
definite form-color (FC) responses of routine 
clerks. 

2. The greater the predominance of indefinite 
form-color (CF) responses over definite form- 
color (FC) responses, the more infrequent was 
the use of human action (M) responses by rou- 
tine clerks. 

3. The greater the predominance of animal 
action (FM) responses to human action (M) re- 
sponses, the smaller was the number of human 
action (M) responses. 

4. The smaller the ratio of animal action (FM) 
to human action (M) responses, the greater was 
the tendency for routine clerks to give four or 
more popular responses in their Rorschach -ec- 
ord. 

5. The smaller the number of human action 
(M) responses, the smaller was the tendency for 
routine clerks to give four or more popular 
responses in their Rorschach records. 


MULTIPLE CORRELATION BETWEEN STRONG 
VOCATIONAL INTEREST SCORES AND 
MUNROE INSPECTION List 
CATEGORIES 

Utilizing the following five categories 
on the Munroe Inspection List, CF:FC, 
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FM:M, FC, M, and Populars, a multiple 
correlation coefficient of .72 with Strong 
Clerical Interest Scores was calculated 
from their tetrachoric correlations. The 
iterative technique, described in an Army 
Air Forces research publication edited 
by R. L. Thorndike (56), was utilized 
for predicting Strong Interest Scores. 
This correlation, corrected for shrinkage, 
was .70. 


RORSCHACH RESPONSES OF Routine CLERKS 


The average number of the total Rorschach 
responses of the routine clerks was 21.4, with a 
standard deviation of 8.7. Lindner's group ot 
100 prison inmates averaged 20.4 responses (33): 
Hertzman's 100 college students had 25.1 Te- 
sponses as their mean (24). The mean number of 
responses for scientists and technicians was 21.0, 
and 18.2 respectively (47). An unselected group 
of 34 adults, age range 26 years to 58 years, 
averaged 30.9 responses (19). 

The percentages of the responses distributed 
among the Rorschach location categories (W, D, 
d, Dd, and S) for the 100 routine clerks, and 
the 34 unselected adults tested by Harrower 
Erickson and Steiner (19) are shown in Table 
XIV. The definitions and significance of these 
Rorschach responses are in Klopfer's manual 
(28). No significant differences were found be- 
tween the percentages of Rorschach location re- 
Sponses used by each group. 

The percentages of the responses distributed 
among the Rorschach determinant responses 
(M, FM, m, k, К, FK, F, Ес, c, Œ, FC, CF, ©) 
for the 100 routine clerks and the 34 unselected 


TABLE XIII 
CORRELATION COEFFICIENTS BETWEEN MUNROE Inspection LIST CATEGORIES* 


CF:FC FM:M 


FC M Popular 
Responses 
CF:FPG .07 —.31ї —.32 = 
.02 —.54 —.36 —.0ў 
ЕМ:М +00 ا‎ э 
4 —.80 —.22 
ЕС .o6 = 
—.оз 14 
M ens 
.23 


* The italicized coefficients were com 
ated by the product moment method. 


puted by the tetrachoric method while the others were calcu- 
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TABLE XIV 


RORSCHACH Location 


RESPONSES or Two GROUPS OF ADULTS 


Yates: Harrower-Erickson 

Location Respo uc 51 Unsclectod 2 

sponses егк5 Adults == 

oD 

Percentage Percentage 

w 42.1 31.8 1.1 

D 44.5 48.1 0.4 

d 4.7 12.8 1:5 

Dd 7.6 5-1 о.5 

S 0:3 2.4 o.8 
adults are also shown (Table XV). The defini. The routine clerks tended to utilize significantly 
deter- more animal action (FM) responses than the 


ы and significance of these Rorschach г 
EISE responses can be found in Klopfer's 
in a (28). The only significant difference was 
Spon percentage of animal action (FM) re- 
eat es used by each group. The other per- 

ages did not reveal significant differences. 


other adults. This difference might be due to 
the dissimilarity in the inquiry procedures. In- 
dividual inquiries were conducted with the 
clerical group while the unselected adults re- 


ceived a group inquiry. 


TABLE XV 


ses ó Two GROUPS OF ADULTS 


н DETERMINANT RESPON: 


RORSCHAC! 
z Harrower-Erickson 
4 Kates: and Steiner: Ü 
Determinant roo Routine 34 Unselected 
Responses Clerks Adults 
L —— — ср 
Percentage Percentage 
M 12.2 9.1 o 
FM 25.3 F £ $ 
m fo e 0.1 
k 1.6 . о.2 
H 2:8 27 а 
m e : | 
БК a 54.3 1.9 
ë É 33.7 T4 1-9 
Её 4.0 ve oe 
с, 1.3 e ie 
F Be 7.0 0.6 
CF $5 6.5 o.8 
Ç о.о о.2 0.9 


CHAPTER V Ç 


INTERPRETATION OF RESULTS 


JOB SATISFACTION 


HE degree of dissatisfaction ex- 
je by these routine clerical 
workers is noteworthy when contrasted 
with typical results for adults in general 
(25). Many of the clerical workers were 
barely satisfied with their work while a 
goodly proportion were either dissatis- 
fied or indifferent. Comparison with 
other investigations (42, 64) indicates 
that this amount of job dissatisfaction is 
fairly typical of clerks. Wrenn studied 
the job satisfaction of college graduates 
who were engaged in many occupations 
(64). He discovered that 19 percent were 
dissatisfied with their jobs in contras: to 
the 52 percent of the clerical workers in 
his group who were dissatisfied (64). In 
another survey, two clerical groups num- 
bering 117 and 102, were satisfied in the 
proportion of 50 percent and 51 percent 
respectively (42). The results obtained in 
the writer's study were strikingly similar 
to the job satisfaction results described 
above, 


Using the same scale and measuring satisfac- 
tion with nursing Nahm ascertained that the 
mean satisfaction score of 428 subjects was 21.8 
and that the standard deviation was 2.86 (41). 
A heterogeneous sampling of 309 subjects, repre- 
sentative of a small community, averaged 20.01 
points (25). (This mean satisfaction score was 
calculated from two frequency distributions.) 
The mean satisfaction score of 154 engineering 
students was 21.06, standard deviation 2.98 (3). 
The mean satisfaction score of the 100 routine 
clerks under present investigation appears to be 
significantly below that reported in the above 
investigations. In addition, the percentage of 
satisfied clerks found in this investigation is 
lower than that found in many other investiga- 
tions (17, 18, 25). 

Workers on higher occupational levels were 
found to be more satisfied than those on the 
lower levels (25, 42, 53). It may be pre- 
sumed that one of the causes for this compara- 
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tive lack of work satisfaction was the relatively 
low occupational status of government clerical 
workers. Associated with this feature were their 
lower earnings. Probably the higher mean score 
of the 17 clerks in Hoppock's study (25), that of 
21.52, was due to the difference in economic 
conditions. This last study was conducted dur- 
ing the depression years, 1932-1933. It may һе 
that during depression years workers are happy 
to hold any job, while in prosperous years they 
аге more dissatisfied when they eye the more 
lucrative. positions. Perhaps the vocational as- 
piration levels of these clerks were pegged 50 
high as a consequence of the urge to rise in our 
society that dissatisfaction with what might ac- 
tually be suitable work ensued unjustifiably. 


The type of work demanded of these 
clerks affords another clue to their dis- 
satisfaction. The work, which was rou- 
tine, did not demand much individual 
judgment or initiative, afforded a dearth 
of social contacts, and required, above 
all, the ability to take direction. Dodge 
had shown that the successful routine 
clerical workers displayed these charac 
teristics. Assuming that the duties of the 
work did not present any difficulties to 
the clerks in the matter of personality 
requirements, then it is surprising that 
these clerks did show so much dissatis- 
faction. Either these clerks’ personality 
characteristics were at variance with 
those demanded by the job, or having 
personality traits characteristic of rou- 
tine clerks is not of cardinal importance 
for the existence of a high degree of job 
satisfaction. 

There is no evidence to support a re 
lationship between the number of years 
of schooling and the degree of job satis- 
faction. The number of years of school- 
ing has been used by Wesman to validate 
his Personnel Classification Test (62). It 
has been shown by Pond and Bills that 
years of schooling correlated ‚Бо with in- 
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telligence test scores of clerks and other 
Workers (48). Due to the broadening of 
educational opportunity, there is proba- 
bly greater correspondence now between 
Years of schooling and intelligence than 
when Pond and Bills completed their 
Study. Young reported that workers of 
Superior intelligence might be bored by 
repetitious tasks but on the other hand, 


.they were often able to arrange their 


Work so that it afforded them relief from 
monotony (65). Assuming that the num- 
ber of years of schooling was a rough 
measure of intelligence, the inference 
might then be made that intelligence and 
job satisfaction of clerical workers are 
not related. This inference might be ap- 
plied to all clerical workers utilizing as 
evidence Uhrbrock's (57) results and the 
above mentioned studies 


Jon SATISFACTION AND VOCATIONAL 


There seemed to be slight evidence 
that, in routine clerical workers at least, 
job satisfaction scores were positively 
related to clerical interest scores on the 
Strong Blank. The product moment 
Correlation (+.21) was significant at the 
five percent level. Breaking the scores 
into meaningful categories as suggested 
by Strong failed to reveal any evidence of 
а relationship between letter ratings Or 
numerical scores on the Strong clerical 
interest scale and being satisfied with, 
indifferent to, and dissatisfied with rou- 
tine clerical work. The lack of demon- 
Strated significant relationship between 
job satisfaction and vocational interests 
may be a function of the lowered sensi- 
tivity of one or both of the tests. Ap- 
parently, the clerical scale of the Strong 
Vocational Interest Blank was of slight 
Value in distinguishing those who were 
content with their work. 


Two additional observations can be made 
with regard to job satisfaction and vocational 
interests: the measure utilized in evaluating job 
satisfaction may have been faulty, and the dif- 
ferences between Strong's criterion group and 
the 100 routine clerks may have contributed to 
a lack of a demonstrated relationship. Hoppock 
(25) and Strong (52) both stated that more satis- 
factory measures of job satisfaction need to be 
constructed. In view of the fact that most work- 
ers reacted to their situations as a whole, it is 
felt that the blank utilized was more adequate 
than many others. It was of importance for this 
study to obtain a self-evaluation of satisfaction 
which this blank again provided. The self-eval- 
uation of work contentment could be influenced 
by many factors extraneous to the work situa- 
tion, as Kornhauser indicated (29). Some would 
not be adept at estimating how they actually 
felt about a job or would not translate their 
feelings adequately when the job satisfaction 
questionnaire was laid before them. Some de- 
gree of self-deception or rationalization could be 
operative. While the workers might like clerical 
work in general, the conditions peculiar to their 
current employment might so color their think- 
ing that they ‘would evaluate themselves as dis- 
satisfied. Hence, it is conceivable that individuals 
might receive A or B+ ratings on the clerical 
scale of the Strong Blank, might like routine 
clerical work greatly, and yet might express in- 
difference to or dissatisfaction with their job. 

It must be recalled that the group upon which 
Strong standardized his office worker scale was 
more heterogeneous than this group and earned 
significantly different scores and ratings on the 
clerical scale. Approximately one-third of 
Strong's group were clerks who could be termed 
administrators having little in common with 
routine clerical workers. In addition, the groups 
were not strictly comparable due to the dif- 
ferences in geographic origin, with Strong's 
group coming from many different parts of the 
country. In the group of 100 routine clerks, 
there might have been many who were engaged 
in clerical work because of economic circum- 
stances despite lack of interests similar to those 
of clerical workers. Moreover, the possibility 
exists that the Strong office worker scale was not 
particularly suitable for routine clerks inasmuch 
as Strong attempted to establish norms for all 
types of office workers, including routine clerks. 
On the other hand, these significant differences 
should not be construed as impugning the valid- 
ity of the clerical scale of the Strong Blank. 
These differences support the conclusion that 
office workers who originate from one geo- 
graphic area and who carry on routine work 
differ significantly in their vocational interests 
from Strong’s criterion group, represented by . 


22 SOLIS L. KATES 


workers throughout the United States and dis- 
tributed among all types of clerical workers. 
Strong has objected to the use of different norms 
for separate groups within one occupation (52). 
He pointed out that his norms have been based 
upon а тоге representative occupational sam- 
pling, usually far more numerous. In treating the 
subjects' scores on the clerical scale, the point 
of reference (that of similarity of interests to 
those of successful office workers, including ad- 
ministrative clerical workers from all parts of 
the country) must be kept in mind. The thought 
cannot be lightly dismissed that an occupational 
scale based on routine clerks as the criterion 
group might so reshuffle the scores on the Strong 
Blank that greater correspondence would be ex- 
hibited between these scores and job satisfaction. 
This type of scale would require a population 
representative of routine clerks throughout the 
country and would form the basis of another 
study especially if carried out coincidentally 
with personality and aptitude tests. 

Certain other facts may help in understanding 
why vocational interest scores were of slight 
value in distinguishing the more satisfied clerks 
from the less satisfied ones. Workers who ; 9s- 
sessed mechanical skills have been found to be 
more interested in the techniques, procedures, 
and products of their work than were unskilled 
workers, who gave predominant importance to 
the working environment (13). Roethlisberger 
and Dickson have pointed out, too, that un- 
skilled workers were more attentive to work 
conditions than supervisors, the latter being 
more interested in what their work duties de- 
manded than in their environmental conditions 
(48). Where personal talents and skills are chal- 
lenged or demanded by a job, it appears likely 
that correspondence of interests to those of 
other successful workers in the occupation would 
be significantly associated with job satisfaction. 
The satisfaction of achieving something difficult, 
of expressing one's skills and abilities, and of 
being able to deal with tasks successfully and 
constructively has been related to the develop- 
ment of vocational interests. Carter has reasoned 
that the individual attempted to identify him- 
self with specific occupational groups whose ac- 
livities he could successfully handle (7). Bordin, 
essentially agreeing with Carter, has stated that 
the individual conceived of himself in terms of 
occupational stereotypes whose requirements he 
satisfyingly and successfully fulfilled and that 
a change in this self-concept resulted in a change 
of vocational interests (5). It is possible, then, 
that the professional and mechanical occupa- 
tions would reveal significant relationships be- 
tween the possession of the related vocational 
interests and job satisfaction. Further research 
is desirable, in this matter, to ascertain if such 


a relationship exists between vocational interests 
and job satisfaction and to throw light upon 
this hypothesis. Nov study, with the possible €x- 
ception of Lorimer's (34) has yet validated this 
plausible and logical relationship. But for an 
occupation such as routine clerical work, the 
nature of the work was not challenging and little 
opportunity existed, generally speaking, to 
achieve something difficult. Not too much corre- 
spondence between interests similar to those of 
office workers and job satisfaction should exist. 
The intrinsic nature of clerical work contributed 
relatively little to job satisfaction, when viewed 
according to the aforementioned standpoint. 

It is vital, now, to examine what practical 
criterion of validity Strong used for his test. He 
stated: 


Because we had no adequate measure of satis- 
faction in their work of the men in the follow- 
up studies to be reported below, we have been 
forced to content ourselves with assuming that 
“continuance in an occupation” is a measure О 
“satisfactory adjustment” (52, p. 387). 


Using this criterion, continuance in an occupa- 
tion, Strong succeeded in validating his blank. 
Strong declared, however, that continuance 1n 
an occupation could not be accepted as the full 
equivalent of job satisfaction because many dis- 
satisfied workers were not free to leave their 
employment. It may perhaps be found that the 
clerical scale of the Strong Blank is more suc- 
cessful in predicting those who will continue in 
clerical work rather than those who will be 
satisfied with the occupation. This last state- 
ment has not been demonstrated in this study, 
but these results combined with Strong’s, make 
it a reasonable conclusion. It is essentially а 
more justifiable conclusion than stating that job 
satisfaction may be predicted from vocational 
interest scores, which this study has found to 
be of questionable accuracy. It can probably be 
stated that all Strong interest scores could pre- 
dict continuance in an occupation, with Strong's 
own data as proof. It is unwarranted to say that 
men will be satisfied with their jobs if their in- 
terests are similar to those of men successfully 
engaged in such work. This hypothesis still re- 
mains unproved even for the professions, despite 
its plausibility. As a consequence of the results 
contained herein, and a reexamination of 
Strong's work, the utility of the Strong Blank 
may be defined more precisely as indicative of 
an individual's continuance in an occupation. 
Routine clerical workers with high scores on 
the Strong Blank may have been attracted to 
their work as a result of those interests. How- 
ever, for routine clerical jobs, possession of in- 
terests similar to those of successful office work- 
ers, as has been seen, was not an important 
factor contributing to job satisfaction. A whole 
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complex of factors, in and away, from the job, 
of which vocational interest is “only one may 
have combined to produce? satisfaction. Thus 
while clerical work may have attracted indi- 
viduals who had high clerical scores on the 
Strong Blank, it did not insure that job satis- 
faction would follow. 

Another possibility must be taken into ac- 
count. Some of the clerks with high scores on 
the Strong clerical scale may have had primary 
vocational interests in other vocational аге 
The presence of competing primary interests 
may have caused a diminution in the degree of 
job satisfaction experienced by clerical workers 
who had high Strong clerical interest scores. 
These competing interests may not have found 
fulfillment in what the clerks were working at 
and may have caused a lowering of job satisfac- 
tion scores, Research is necessary to ascertain 
Whether two compcting and apparently equally 
Strong interests prevent the gaining of satisfac- 
tion from the pursuit of one of these occupa- 
tions. 

Increasing importance has been assigned to 
vocational interest patterns rather than scores 
on single scales. Strong and Darley suggested 
that guidance should not rely upon single in- 
terest scale scores, but should have as its basis 
scores earned within separate interest patterns. 
Possibly if the Strong Vocational Interest Blanks 
had been scored according to interest patterns, 
there might have been more correspondence 
With job satisfaction scores. This possibility 
should be investigated for its practical import 
to guidance and for determining the compara- 
tive value of single scales as compared with in- 
terest patterns. . 

RonscHACH ADJUSTMENT 

The evaluation of the clerks’ 
Schach responses using the Davidson 
signs resulted in mean scores that perm! 
conflicting interpretations. When com- 
pared with Davidson's original group 
(10), there is a wide difference In the 
number of adjustment signs, while the 


Ror- 


groups that Margulies (35) tested do mot 
appear to be much different. The ees 
the 


Nificance of differences between 
Sroups was not tested because both 
Margulies’ and Davidson's groups were 
Composed of children and because only 
16 of the 17 Davidson signs were utilized 
In evaluating the Rorschach responses O 
the clerks. 


The Munroe Inspection Technique 
seems to demonstrate clearly that the 
clerks exhibited more Rorschach signs 
of maladjustment when compared with 
college students, artists, scientists, and 
technicians. The Munroe Technique pre- 
sumably affords a fairer appraisal than 
the Davidson List inasmuch as the com- 
pared groups were composed of adults. 
When so considered, the results agree 
with those of Dodge (11) who found that 
the successful routine clerks considered 
themselves more ill at ease in social con- 
tacts, less willing to accept responsibility, 
and more dependent upon others. Ac- 
cording to standard Rorschach interpre- 
tation these traits would be among those 
contributing to scores for maladjustment. 
While the clerks had many Rorschach 
sigas of maladjustment, it is possible 
that these traits were ones that aided 
them in adapting to their work. With- 
out these traits, it is conceivable that 
these routine clerks might have experi- 
enced difficulty in accepting the condi- 
tions of their work. In his study of 142 
psychoneurotic ex-soldiers, Pratt revealed 
that there was a definite tendency to 
enter clerical work after having failed to 
adjust to other types of employment (44). 
Clerical work was also reported to have 
helped rclieve their symptoms (44). 
Another suggested possibility is that cleri- 
cal work is a refuge of individuals with 
certain characteristics. These deductions 
would apply to routine clerical workers 
and would not be descriptive of ad- 
ministrative clerks, or clerks whose jobs 
entail extensive public contact. 


Јов SATISFACTION AND RORSCHACH 
ADJUSTMENT 


There was no apparent relationship 
between the degree of job satisfaction 
and scores оп either of the objective 
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son List is not as extensive as the Munroc 
Technique nor are there any such fine 
distinctions in the weightings assigned 
to differences in quality and quantity of 
Rorschach responses. 


PREDICTION OF CLERICAL INTERESTS FROM 
MUNROE INSPECTION List CATEGORIES 


The multiple correlation was .72 be- 
tween clerical interest scores on the 
Strong Blank and the following five 
Rorschach categories on the Munroe In- 
spection Technique: CF:FC, FM:M, 
FC, M, and Popular, This represented 
the correlation between the Strong cleri- 
cal interest scores and the best estimates 
of these Strong scores made from a linear 
combination of the aforementioned five 
Munroe List categories. The multiple 
correlation coefficient was -70 after tor- 
rection for shrinkage. Knowing the 
entries made for these five Munroe List 
categories, it would be possible with fair 
accuracy to predict Strong clerical inter- 
est scores of routine clerks. Forty-nine 
percent of the variance of the vocational 
interest scores of routine clerks could be 
attributed to the joint action of these 
five Munroe List categories. The remain- 
ing 51 percent of what made the routine 
clerks differ in vocational interest Scores 
could be attributed to factors other than 
these five Munroe List categories. 


VOCATIONAL INTERESTS AND RoRscHACH 


RESPONSES: THEORETICAL IMPLICATIONS 


Routine clerks who showed interests 
similar to those of successful office work- 
ers were more likely than others to reveal 
what Rorschach examiners would call 
severe personality disturbances. The 
higher the scores on the clerical scale of 
the Strong Blank, the more numerous 
were the signs of personality maladjust- 
ment in their Rorschach responses. 


KATES 


According, to standard Rorschach in- 
terpretation, routine clerks who had 
interests most similar to those of success- 
ful office workers were prone: 


1. To impulsive emotional reactions to stimuli 
(the interpretation of the predominance of in- 
definite form-color [CF] over definite form color 
[FC] responses). These clerks had slight conscious 
control over their emotional reactions to stimuli. 
Their activities seemed largely determined and 
provoked by immediate environmental stimuli. 

2. To show an inadequate degree of emotional 
adjustment to outer reality (the interpretation 
of an infrequent use of definite form-color [FC] 
responses). These cle apparently, were not 
able to make adequate social contacts with their 
environment. 

3. To be emotionally immature (the interpre- 
tation of the predominance of animal action 
[FM] over human action [M] responses). They 
were prone to be dependent individuals whose 
inner strivings were mainly on an immature 
level. 

4. To be unable to accept their own ideas and 
creations (the interpretation of the infrequent 
use of human action [M] responses). They did 
not seem to take their own outlook and ideas 
seriously. 

5. To show a slight incapacity to think along 
conventional lines (the interpretation of the in- 
frequent. use of popular responses). These sub- 
jects appeared to show a slight inability to in- 
dulge in thinking common to most people. 


The section on Rorschach interpreta- 
tions has been offered with much hesita- 
tion, in view of the many unsubstantl- 
ated interpretations and varying mean- 
ings assigned to the Rorschach scoring 
symbols by some Rorschach examiners. 
It has been included only to describe the 
Rorschach characteristics of routine 
clerks and the associations of these Ror- 
schach responses with vocational interests 
and job satisfaction in terms of the more 
commonly accepted interpretations of 
Rorschach examiners. 


FURTHER RESEARCH SUGGESTIONS 


The relation between the lack of primary in- 
terest patterns and job satisfaction should be as- 
certained. The relation between the lack of 
primary interest patterns and personality trails 


>. 
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such as those the Munroe Inspection List was 
intended to assess should be invitigated. The 
three nonoccupational scales of the Strong Vo- 
cational Interest Blank should also be related 
to measures of job satisfaction and personality. 

It would be worthwhile to score a group of 
life insurance salesmen on their own occupa- 
lional scale and upon scales such as those for 
office workers or mathematicians, and to relate 
these scores to the Rorschach categories of the 
Munroe Inspection List. Mathematicians and life 
insurance salesmen seem to represent opposite 
extremes in their social predilections. Will such 
tendencies be reflected in the Rorschach Test 
results and the degree of association of Munroe 
Inspection List categories to interest scale scores? 

А developmental or cross-sectional study of 
young men at ages 17, 19, 21 and 23 using the 
Rorschach Test and the Strong Vocational In- 


terest Blank would be very worthwhile. Changes 
in scores on the appropriate interest scales in 
relation to changes in the several categories of 
the Munroe Inspection List would help in clari- 
fying the relation of personality patterns to 
vocational interests. 

Another suggestion would envisage the testing 
with the Rorschach Test and the Strong Blank 
of students at а business school, clerks who had 
been on the job between six months and one 
year, and clerks who had been on the job 5 
years or more. Would these groups show pro- 
gressively higher mean scores on the Strong 
clerical scale? It would be worthwhile to ascer- 
tain whether certain Rorschach factors, appar- 
ently associated with the possession of interests 
similar to those of successful office workers, be- 
came more dominant in clerks with greater 


clerical job experience. 


СНАРТЕК VI 


1 
SUMMARY AND CONCLUSIONS 


Problem 


HE aim of this study was to investi- 

gate the vocational interests, Ror- 
Schach responses, and job satisfaction of 
a group of 100 men employed in routine 
clerical work by the Federal government. 
The interrelationships among such fac- 
tors as job satisfaction, vocational inter- 
ests, and Rorschach signs of maladjust- 
ment were sought. Those Rorschach 
responses which were related significantly 
to job satisfaction and vocational inter- 
ests were isolated. The subjects were 
chosen so that they fulfilled the following 
conditions: (а) permanence and stability 
of employment; (b) similarity of occupa- 
tion; (c) presumably similar general ma- 
terial working conditions; (d) restricted 
salary range; (е) employment in a voca- 
tion for which a scoring scale had been 
established on the Strong Vocational In- 
terest Blank. 

It was desired, through this study, to 
ascertain if the following hypotheses were 
tenable: 

1. The job satisfaction of routine office 
clerks bears a significant positive rela- 
tionship to the possession of interests 
similar to those of successful office work- 
ers as measured by the Strong Vocational 
Interest Blank. 

2. The job satisfaction of routine 
clerks is not associated with the number 
of Rorschach signs of maladjustment. 

8. The work dissatisfaction of routine 
clerks who possess the interests of success- 
ful office workers is not related to the 
number of Rorschach signs of maladjust- 
ment. 

4. Certain Rorschach responses shown 
in the Rorschach records of these clerks 
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are significantly associated with the pos- 
session of vocational interests held by 
successful office workers. 


Procedure 


The 100 routine clerks completed the 
job satisfaction blank, the Strong Voca- 
tional Interest Blank, and the Rorschach 
Test. The administration of the tests to 
the subjects took place in groups of four 
or five. Anonymity was guaranteed to all. 
Standard procedures were followed with 
the job satisfaction blank and the Strong 
Vocational Interest Blank. As for the 
Rorschach Test, the clerks ‘after being 
given sets of cards, wrote their responses 
in special booklets, and separate indi- 
vidual inquiries were conducted for all 
the protocols. 

The Strong Blank was scored with the 
office worker scale devised by Strong: 
Scores were computed for the job satis- 
faction blank. The Rorschach protocols 
were evaluated by two techniques, the 
Munroe Inspection Technique and the 
Davidson List of Signs. Numerical scores 
were obtained for each list, indicative of 
the degree of Rorschach maladjustment 
and adjustment. The relationships be 
tween each of the separate measures were 
obtained through product-moment corre- 
lations, biserial correlations, partial 
correlations, and analysis of variance and 
chi-square methods, In addition, several 
of the categories in the Munroe Inspec 
tion Technique were related to job satis- 
faction and to Strong clerical interest 
scores. Correlations among five of the 
categories on the Munroe Inspection 
Technique were computed by product 
moment and tetrachoric methods. FF 
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nally, a multiple correlation; coefficient 
was calculated for the prediction of the 
Strong clerical interest scores of routine 
office clerks from five categories of the 
Munroe Inspection List. 


Findings 

The first hypothesis was not supported 
by the results of this investigation. The 
job satisfaction of routine clerks ap- 
peared to have no significant relationship 
to the scores and letter ratings earned 
on the clerical scale of the Strong Voca- 
tional Interest Blank. The differences 
between the criterion group used by 
Strong in standardizing the clerical inter- 
est scale and the routine clerks were 
pointed out. It was suggested that ifa 
Vocational interest scale based upon rou- 
tine clerical workers had been utilized, 
greater association between the degree of 
job satisfaction and the possession of the 
corresponding vocational interests might 
have been demonstrated. Moreover, 
other reasons were advanced to support 
the concept that satisfaction with routine 
clerical work might not be associated 
With the possession of the interests of 
Successful office workers. 'The nature of 
routine clerical work and the relative 
lack of demand that it makes on the 
Skills and talents of the incumbents were 
underscored. A close relationship be- 
tween job satisfaction and vocational 
interests was posited for professional and 
Skilled mechanical workers in view of 
the relative satisfyingness of achieving 
Something creative or difficult. Little 
such opportunity exists for routine 
clerks, The presence of competing VO- 
cational interests may have caused а 
diminution in the association between 
job satisfaction and vocational interests 


ОЁ routine clerks but was not investi- 


gated. Vocational interest pattern scores, 
embracing a family of related occupa- 
tions, may have shown greater relation- 
ship to job satisfaction than single inter- 
est scale scores, but this possibility also 
was not investigated. Attention was di- 
rected to the fact that while vocational 
interests may have attracted some of 
these routine clerks to their work, other 
factors may -be more important in the 
occupational choice and job satisfaction 
of clerical workers than vocational inter- 
ests. 

The second hypothesis was confirmed: 
there was no relationship between job 
satisfaction and the number of Ror- 
schach signs of maladjustment for rou- 
tine clerks. Routine clerks may be satis- 
fied with their work despite many 
Rorschach signs of maladjustment. Ap- 
parently whatever is revealed by many 
Rorschach signs of maladjustment did 
not prevent routine clerks from deriving 
satisfaction from their work. The validity 
of the Rorschach Test was questioned 
because it failed to distinguish between 
the satisfied on the one hand and the 
indifferent and dissatisfied on the other 
hand. An equally credible alternative was 
that the reasons for work satisfaction or 
dissatisfaction in office clerks should not 
be sought in the psychological context, 
i.e., in responses to the Strong Vocational 
Interest Blank and the Rorschach Test, 
but rather in the social situation, as sug- 
gested by recent intensive investigations. 

The third hypothesis was sustained: 
the job dissatisfaction or indifference of 
routine clerks who possessed the inter- 
ests of successful office workers was not 
ated with a greater number of Ror- 


associ 
schach signs of maladjustment. The 
presence of many Rorschach signs of mal- 


adjustment did not detract from or add 
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to the job satisfaction of routine clerical 
workers who have the interests of suc- 
cessful office workers. 

The fourth hypothesis was supported 
by the results. There were five Rorschach 
response categories significantly associ- 
ated with the possession of vocational 
interests by routine clerks similar to 
those of successful office workers. 

These five Rorschach signs of malad- 
justment that were correlated signifi- 
cantly with the possession of vocational 
interests similar to those of successful 
office workers included: (a) the use of 
indefinite form-color (CF) responses in 
preference to definite form fused with 
color (FC) responses; (b) a slight tend- 
ency to use an insufficient number of 
definite form-color (FC) responses; (c) a 
proneness to use predominantly iñore 
animal action (FM) than human action 
(M) responses; (d) relatively few human 
action (M) responses; (e) slightly but sig- 
nificantly infrequent use of popular re- 
sponses. 

The Rorschach characteristics of rou- 
tine clerks appeared to be quite similar 
to those of the group with which they 
were contrasted except for the clerks' 


greater use of animal action (FM) re 
sponses. This characteristic was indi- 
cated as a Rorschach sign of maladjust- 
ment that the clerical group as a whole 
manifested. 

The two measures of Rorschach ad- 
justment, the Davidson List of Adjust- 
ment Signs and the Munroe Inspection 
Technique, were shown to be highly 
correlated. It was concluded that they 
were both measuring a similar pattern 
or similar patterns. The extent to which 
the two lists arrive at different evalua- 
tions of adjustment is a function of their 
two major differences: the Davidson List 
is not as extensive, and does not include 
such fine distinctions in quality and 
quantity of the Rorschach responses. 

Intercorrelations were obtained be- 
tween several of the Rorschach categories 
on the Munroe Inspection List, and a 
multiple correlation coefficient of .7o was 
computed for the prediction of Strong 
clerical interest scores from these several 
categories. 

Further supplementary areas of re- 
search were delineated and several hy- 
potheses were discussed which still re 
quire substantiation. 


APPENDIX А 


JOB SATISFACTION BLANK 
(After Hoppock, 43) 


NUMBER _ 


You are asked to help in a scientific 
study by answering the questions in this 
blank. Neither your employer nor any 
of your associates will be allowed to see 
your answers. Your replies will be added 
to those of many other people, and only 
the group totals will be published. Your 
answers will be worthless unless they are 
perfectly frank and truthful. 

Choose the ONE of the following state- 
ments which best tells how well you like 
your job. Place a mark (X) in front of 
that statement. 


1. — .. hate it. 

2. ү dislike it very much. 

3. —— I don't like it. 

4. —— I am indifferent to it. 

5. — L like it. 

6... I am enthusiastic about it. 

7. р like it more than I could any 
other job. 


Mark one of the following to show 
how you think you compare with other 
people. 

7- — No one like 
Ilike mine. 


s his job better than 


6. I like my job much better than 
most people like theirs. 
5. — I like my job better than most 


people like theirs. 
4. — I like my job about as well as 
most people like theirs. 
3. — I dislike my job more than 
people like theirs. 


most 
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о, ___I dislike my job much more than 
most people dislike theirs. 
1. — No one dislikes his job more than 


I dislike mine. 

Mark the one of the following which 
best tells how you feel about changing 
your job. 

1. I would quit the job at once if I 
could get anything else to do. 

a. I would take almost апу other 
job in which I could earn as 
much as I am earning now. 


8. ——I would like to change both my 
Р job and my occupation. 
4. — 1 would like to exchange my pres- 


ent job for another job in the 
same line of work. 

5. — I am not eager to change my job. 

but would do so if I could get a 

better job. 

6. — I cannot think of any jobs for 
which I would exchange mine. 

. ..] would not exchange my job for 
any others. 

Mark one of the following to show 
HOW MUCH OF THE TIME you feel 
satisfied with your job. 

Vee АП of the time. 

6. Most of the time. 

. ___A good deal of the time. 
. _ About half of the time. 
8. ___ОссазїопаһПу. 

____ Seldom. 

Never: 


-Y 


Date 
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ARBITRARY WEIGHTS, AND ENTRIES CONSIDERED CORRECT FOR BISERIAL 
AND TETRACHORIC CORRELATIONS 
(MUNROE INSPECTION List CATEGORIES) 


PRODUCT MOMENT CORRELATION WEIGHTS 


Color: Movement FM:M M CF:FC FC 
Entries Weights Entries Weights Entries |Weights| Entries |Weights| Entries (Weights 
س‎ 1 +++ ї +++ 1 +++ 1 == 6 
—— 2 ++ 2 FF 2 T 2 == 5 
_ 3 + 3 + 3 EE: 3 o 4 
o 4 o 4 o 4 o 4 
t 5 = 5 = 5 = 5 
++ 6 -- 6 -- 6 
EPE 7 


ENTRIES CONSIDERED CORRECT FOR BISERIAL AND TETRACHORIC CORRELATIONS 


Category 


Entries Considered Correct 


Color : Movement 


CF:FC 


"Total Color 
Total Movement 
FK, Fc 


+++ 
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‚ Universal symbols do exist. Howeve 
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CHAPTER I 


PROBLEM 


A. BACKGROUND STUDIES 


"Г first experimental attack on the 
A problem of symbolism is credited by 
Sigmund Freud (3-а) to Scherner. In his 
attempt to establish the causes under- 
lying dream symbols, Scherner investi- 
gated the relationship between various 
stimuli and the dream-work. From his 
experiments he concluded that dream 
symbols are the product of physiological 
tensions, 

Freud (3-b) felt that Scherner had not 
Bone far enough into the problem. In 
working out the patterns of the dream 
process, Freud had observed that certain 
forms repeated themselves, with varia- 
tions, in different subjects. These forms 
seemed to have the same symbolic sig- 
nificance; although how the patient felt 
about them was indicated by other 
aspects of the dream-work, and varied 
from individual to individual. Stekel 
(21), Gutheil (8), and others later con- 
firmed Freud's findings. 

Psychoanalysts as а group have contributed 
much of the interest in, and most of the dis- 
cussion about, universal symbols in recent times. 
They differ in their viewpoints, however. Some 
analysts such as Jung (12), White (23), айй 
Barran (2), impressed by parallels between primi- 
tive myths and the dream-work of their patients, 
have concluded that there must be an “archaic 
unconscious,” common to all people, which con- 
sists of the inherited experiences of earliest man. 
Horney (10) took exception to this. She has held 
that cultural factors and variations in individual 
Xperience make it impossible to generalize on 
Symbolic meanings to the extent of assuming the 
existence of universal symbols. 

Others, like Ferenczi (5), and Symons (22), 


Seem to have formed the clinical impression that 
т, they have 
not found it necessary to resort to constructs like 


“ А Ars e imi- 
the archaic unconscious" to explain the sim 


larity between private symbol-systems, such as 
are found in dreams, and primitive myths. 
Neither have they found it necessary to ignore 
such a similarity because of chronologically sec- 
ondary, though clinically important, indivdual 
differences. They base their theory on the belief 
that the process of thought is similar in all 
people. 

Some non-analysts will not accept the concept 
of universal symbolism, but not so much because 
they are opposed to the concept as such. What 
they oppose is the methods used to establish uni- 
versal symbolism. Havelock Ellis has fought the 
subjective bias in psychoanalysis which, he said, ` 
was at the basis of the belief in universal sym- 
bolism. He thought that the analysts, unwittingly 
or by design, have communicated to patients 
their own conceptions of what certain symbols 
món. The possibility that this might be true 
raises certain questions as to the reliability, not 
to say validity, of various types of clinical 
findings. 

Anthropologists have been concerned with the 
problem of universality in human experience. 
The present tendency is to emphasize cultural 
differences. However, some workers have been 
even more impressed by apparently basic simi- 
larities. For example, Roheim (18) established 
that there are certain symbols which appear 
among all the Australian tribes he had studied. 
Hallowell (9) has found it possible to analyze an 
Indian’s dream in terms of Freudian symbols. 

Various concepts growing out of psychoanaly- 
sis have been checked experimentally by Sears 
(19) and others, but the concept of symbolism 
as such has never been studied by experimental 
methods. Interest in projective techniques re- 
cently has drawn attention to the problem anew. 
"The Rorschach test in particular, in its "qualita- 
tive" or symbolic-content aspects, carries the im- 
plication that universal symbols can be assumed 
to exist. Lindner (14) wrote about diagnostic 
Rorschach signs in terms of this implication. 
However, Lindner's observations also were in 
the nature of clinical findings rather than ex- 


perimental results. 

Among psychologists, Franck (7), assuming the 
correctness of universal symbolism, constructed a 
test of esthetic preference based on "masculine" 
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and "feminine" figures. Similarly Buck (4) in 
constructing his House—Tree—Person (H-T-P) 
test, has posited a certain universalism of re- 
sponse as he generalized on the cognitive aspects 
of his symbols. 

Alschuler and Hattwick (1) have made a com- 
prehensive study of symbolism in the paintings 
of young children. The method of these authors 
was completely inductive. When they compared 
the free drawings of children, they found evi- 
dence for assuming definite uniformities in the 
images of their subjects. Similar conclusions were 
reached by Immergluck and Cohen (11) who in- 
structed their subjects to react to emotionally 
charged words (e.g. mother, father, love, war) 
by chance designs produced on paper. They 
found striking similarities in rhythmic patterns 
produced in association with given word stimuli. 
This study is especially interesting because it was 
not the purpose of these investigators to establish 
the possibility of symbolism in any form. 

In summarizing the theoretical ma- 
terial in the literature pertaining to cal- 
tural symbolism, we may conclude that 
psychoanalysts in general have accepted 
the existence of universal symbols. They 
have found the concept useful in dream 
analysis and the interpretation of symp- 
toms in the course of therapy. Anthro- 
pologists have applied the same concept 
to the analysis of myths of different peo- 
ples. However, because psychoanalysts 
and anthropologists have used either 
clinical or deductive methods for the 
most part, psychologists as a rule have 
declined to accept their conclusions. Re- 
cent psychological studies, such as those 
of Franck, Buck, Alschuler and Hattwick, 
and Immergluck and Cohen, have either 
assumed or inferred symbolic values, but 
did not aim to establish them. 

This, substantially, was the state of the 
literature when the present study was 
begun in the winter of 1946. 


B. Basic Concepts 


This study is concerned with reactions 
to certain visual stimuli. These stimuli 


have two characteristics in 
First, they are line figures which might 
be called nonsense patterns, because they 
are not pictorial representations of any 
obvious object, person or human detail; 
second, they have been presumed to pos- 
sess a universally symbolic value. Because 
neither these nor any other line-patterns 
have been accepted as universal symbols 
by psychologists, this study raises the 
general question of whether universal 
symbols may be said to exist at all. 

Clarity of discussion demands a defi- 
nition of terms. Before details are given 
as to how this study was designed, we 
must make clear what we mean here by 
universal symbols. There are three points 
to consider: 1. What is a symbol? э. What 
implications does the concept of unt 
versality carry in this study? 3. How can 
supposed nonsense figures actually be 
universal symbols? 


common. 


First, the term symbol has been given a variety 
of meanings by various people, depending upon 
the application involved. There are as many 
definitions for the word "symbol" as there are 
authors writing on the subject, but the layman's 
meaning of “something that stands for some- 
thing else" is preserved in all of them. For our 
purposes, the definition given by C. W. Morris 
(16) is most acceptable in that it emphasizes the 
stimulus-value of a symbol. He defines the 
symbol as a "given (or experienced) substitute 
stimulus leading to a reinstatement of the 
original stimulus in a form observable only from 
the self-inclusive point of view.” This definition 
implies that all symbols affect behavior by first 
evoking previous experience. It also implies that 
the image of the original experience is always 
peculiar to the individual. 

A symbol which is effective for more than 
one person is based upon experience that has 
been shared by all for whom the symbol i$ 
effective. However, since any experience 5 
uniquely personal (in the psychobiological sense), 
the reaction to any symbol is also uniquely Ре! 
sonal. The Morris (16) definition holds for all 
symbols, whether verbal or non-verbal. Just how 
it applies may most easily be seen in a verbal 
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sample. When an English-speaking person is 
Biven the stimulus word, “Mothet,” he responds 
both in terms of: (a) the concept “Mother” as 
shared by all individuals (e.g, the quality of 
bearing children) and (b) the image of his own 
mother as it remains in his memory from his 
own experiences with her (e.g, enforcement of 
cleanliness, emphasis on religion, etc). In other 
words his response contains elements which are 
found in the responses of others and elements 
Which no one else's responses could contain. 
Second, to assume that a symbol possesses a 
general (as distinquished from a uniquely per- 
sonal) value, is to lay claim to two points: One, 
that there are experiences which are general, so 
that responses to certain symbols include rein- 
statements of original stimuli which are general 
for all people; and two, that these experiences 
may be symbolized by a form which is capable 
of reinstating the original (personal) experience. 
That generalized cultural (or reactive) experi- 
ences do exist among all peoples has been estab- 
lished by numerous anthropologists. A recent 
Study by Ford (6) has demonstrated universal 
behavior patterns in 25 preliterate groups. Some 
of the patterns for which he listed 100% uni- 
versality are eating, talking, coitus, etc. 
Perception of structural differences in human 
beings may be added as a fairly general experi- 
ence among people. Thus Seligman (20) has 
made the point that similar biological char- 
acteristics of human individuals are responsible 
for the universal features of certain culture pat- 
terns, An unanswered question is whether the 
form of symbolization is generalized enough to 


include even various individuals in a given cul- 
ture, let alone all people everywhere. This is an 
nay throw some 


issue on which the present st udy m 
light but on which it will not seek to offer final 
Proof, 

The final question that ari 
Seeming nonsense patterns coul 
alized symbolic value. The patterns 
obviously abstract, i.e., remote from 
human experience. When individuals respond to 
them nonetheless, it is surely not safe to assume 
that their response involves а conscious recogni- 
Поп of the patterns as symbols. It is possible, 
however, to assume that these abstract patterns 
are substitute (symbolic) stimuli which are 
Tecognized, though not on 2 level of clear 
understanding, as having а Vague relationship 
to the original stimulus. Such perception, it 
must be emphasized, is not on а level of 


ses here is how 


d have a gener- 
involved are 
concrete 


conscious awareness. In other words, the 
stimuli here operate without becoming under- 
standable in verbal terms. The term “uncon- 
scious” is sometimes used to describe responses to 
the stimuli in question; but because of semantic 
difficulties, this term is here avoided, and the his- 
torically more respectable term “subliminal” (15) 
is used instead. 

In summary, a generic or general 
symbol is defined as a substitute stimulus 
capable of reinstating a basic human ex- 
perience which is part of the shared ex- 
perience of many people in a given cul- 
ture at least. Such a symbol may be so far 
removed from phenomenal reality as we 
know it that it can operate only as а sub- 
liminal cue. The response to it will be 
general only in the sense that the symbol 
will be an effective stimulus for all people 
sharing certain basic experiences. How- 
ever, each person’s response to such a 
“generalized symbol," as indeed to less 
common stimuli, must be assumed to in- 
clude elements that are both unique and 


personal. 
C. EXPERIMENTAL DESIGN 

Eleven hypothetical symbols repre- 
senting as many basic human experiences 
were included in this study. These sym- 
bols were designated as stimulus patterns, 
and were presented to the subjects as 
undifferentiated stimulus material. 

The reactions of subjects to the stimu- 
lus patterns were obtained by the follow- 
ing method: The subjects were handed 

encils and eleven sheets on each of 
which appeared one of the stimulus pat- 
terns. The directions asked for drawings 
of anything they pleased to present on 
each sheet, using the lines already on the 
sheet, if they could (details in Chapter 
II, section on Presentation of Stimuli). 
Next, an inquiry was conducted to estab- 
lish what they had done and how they 
had felt about each of their drawings. 


In this way, not only the subjects’ draw- 
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ings, but their verbal associations with 
the stimulus material were obtained as 
well. In each case, the two kinds of re- 
actions, the drawings and the verbal 
associations, were considered the sub- 
ject's response. 

In this study it was assumed that the 
subjects' responses covered areas of ex- 
perience of which the stimulus-patterns 
were symbolic. In the light of this as- 
sumption it was expected that specific 
attitudes of the subjects growing out of 
these areas might be revealed in their 
responses. To establish or disprove this 
possibility, the interpretations of the sub- 
jects' responses were checked against 
material gathered on these same subjects 
by independent workers. The material 
used for validation was referred to as 
validation data. ^ 

In brief, then, the design of this study 
involved the assumption that, if a given 
stimulus possesses a generalized symbolic 
value, it should be possible to infer spe- 
сїйс facts about a subject from his re- 
sponse to that stimulus. These facts 
would relate to the basic human experi- 
ence symbolized, in terms of its meaning 
for the subject. Conversely, if it is possi- 
ble to discover accurate and specific 
facts about a subject from his reactions 
to a stimulus with a hypothetically 
generalized symbolic value, the repeti- 
tion of this procedure with many subjects 
should substantiate the hypothetical 
meaning of the stimulus. 


D. SELECTION OF STIMULUS PATTERNS 


Eleven basic experiences were selected 
as probably common to all human per- 
sonalities in our culture, and possibly 
other cultures as well.! Visual forms were 
then adapted from various sources (See 


'Copyright of patterns symbolizing these ex- 
periences, as presented in this section, is reserved. 


Background Studies) as symbolic of each 
area of experience. Thus eleven visual 
forms came to be used as stimulus-pat- 
terns in this study. Responses to the stim- 
ulus-patterns were considered to have 
been determined by the subjects’ atti- 
tudes toward the “reinstatement” of the 
“original stimulus” (the basic experience) 
evoked by its supposed substitute-stimu- 
lus (the hypothetically symbolic stimulus 
pattern). 

All of the stimulus-patterns were pre- 
sented in as slightly structuralized form 
as possible,? so as to provide maximum 
opportunity for modification by the sub- 
ject. The following is a list of the sym- 
bols selected and the sources from which 
they were adapted. It is considered be- 
yond the scope of this study to establish 
why these particular symbols relate to the 
areas symbolized. However, theoretical 
reasons are suggested. 


1. The semi-circle pattern. (Fig. 1). 10 seems 
to be a safe assumption that any human society 


Fic. 1, The semi-circle pattern. 


provides its members with the experience of 
“femaleness.” This implies only that every indi- 
vidual learns to differentiate between the two 
sexes. How a given individual interprets “fe” 
maleness" depends upon his personal experience 
with females. As stated in the discussion of 


*The actual order of presentation was: (Q) 
blank sheet, (2) semi-circle, (3) semi-circle ал 
single line, (4) single straight line, (5) invertec 
"V," (6) diffuse mass, (7) crosslines, (8) counter 
opposed semi-circle, (9) semi-circle within. large? 
semi-circle, (10) jagged line, (11) incomplete “U, 
(12) lines and semi-circles. 
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"Basic Concepts,” the image 4voked by any 
generalization is necessarily peculiar and private 
to the subject. It is probably true that similari- 
tics between images of people from a given 
culture are greater than those between people 
from different cultures. The hypothesis to be 
tested by the first stimulus-pattern is that it is 
symbolic of “femaleness,” or refers to aspects of 
"femaleness" which are fundamental to any such 
generalization. 

The symbol for this basic experience origi- 
nated with Freud (3-с, р. 378) . It was his clinical 
observations that led him to assume that round- 


Fic. 2. The 


Dess is symbolically associated with the concept 


Of “femaleness.” Alschuler and Hattwick (1), in 
their study of children’s drawings have found 
that both rounded forms and square ones were 
characteristic of children when they were be- 
having along lines of more “feminine” patterns. 
Also when their children reported negative feel- 
ings about their mothers, they tended to smear 
9r otherwise destroy rounded forms in their 
drawings. 
2 Franck (7) based her work on the concept that 
feminine” design was composed of round forms. 
Similarly, Immergluck and Cohen (11), in classi- 
lying drawing patterns obtained in response tO 
Verbal stimuli, found unexpectedly that round 


Counter-opposed semi-circle 


lines tended to be associated with the concepts 
“mother” and “love.” 

In the light of such findings, the author has 
chosen the semi-circle, as a rounded form, to rep- 
resent the most likely symbol for “femaleness.” 
The drawing responses associated with this form 
were expected to reflect the subject's attitudes 
toward females, within the orbit of his or her 
personal experience. А 

s. The counter-opposed semi-circle pattern. 
(Fig. 2). One semi-circle has been assumed to 
mobilize attitudes toward the quality we desig- 
nate as “femaleness.” Going one step further, 


pattern 


we could assume that this same concept could be 
utilized to test attitudes toward females gen- 
eralized as "female society." For this reason, it 
seemed desirable to devise a field containing 
more than опе semi-circle. 

The pattern in question derives from a basic 
rationale which is thus similar to that of the 
first pattern, except for the aspect of pluralism 
which it is meant to convey. To avoid the impli- 
cation of two breasts, which might refer to one 
woman, the two semi-circles were given a go? 
tilt. In thus counter-opposing two symbols of the 
same independent significance, the pattern was 
ably insured against perception in its 


presum 
instead of in its plural, form. 


singular, 
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Fic. 3. The semi-circle within a larger semi-circle pattern, 


If the assumption made is valid, then a female 
subject, identifying with one of the semi-circles, 
might reveal her attitude toward another 
woman; while a male subject, unable to find 
ground for identification in this figure, might 
show his attitude toward more than one woman. 

3. The semi-circle within a larger semi-circle 
pattern. (Fig. 3). Adjustment to the increased re- 
sponsibility which growing older engenders at 
every stage, may be considered a broadly human 
area of experience. The desire to regress, meas- 
ured by the strength of dependence on the 
mother, and attitudes toward siblings, are all 
related to this area. 

Roheim (18, p. 102) discusses what he calls 
the “concentric circle pattern" in relation to the 
fertility rites of Australian primitives. His treat- 
ment of the subject suggests that this might 
be a universal symbol for the "child and mother" 
area of experience. Freud (3, p. 375) of course 
made a good deal of this kind of symbolization. 
Alschuler and Hattwick (1) have found that 
rounded figures, or square ones, drawn with 
other geometrical forms placed inside, were 
characteristic of children's drawings shortly prc- 
ceding the birth of younger siblings, and per- 
sisted for a time after their birth. These con- 


Fic. 4. The single straight line pattern: 


ç _ 
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clusions support the Freudian hypothesis that 
forms-within-other-forms are symbols of intra- 
uterine existence or maternal protection. Al- 
schuler and Hattwick have named the pattern 
“the container configuration.” 

The relationship to both mother and siblings 
was expected to emerge from the drawings based 
on this pattern, 

4. The single straight line pattern. (Fig. 4). 
"Maleness" is considered to be another basic 
human experience. The reasoning in selecting 
"maleness" as one of the commonplace areas of 
experience is the same as that used in the selec- 
tion of "femaleness." 

Freud (3, p. 368), generalizing on the shape of 
the male sex organ, is responsible for the 
generalization that, in dreams, oblong objects 
symbolize the quality of “maleness.” Buck (4) 
in a cautious statement, has also intimated 
that, in his H-T-P test, a tree might carry 
phallic significance. Alschuler and Hattwick (1), 
in their study, have more fully confirmed Freud's 
generalization by showing that straight lines о! 
elongated figures are connected with aggressive 
"masculine children." When 


1 Hattwick found, chil- 
draw them 


moods of the morc 
frustrated, Alschuler and 
dren cross out the straight lines, or 
horizontally instead of vertically. In Franck's (7) 
study, "masculine design" consisted of clongated 
forms. She found that, on the basis of round 


Fic. 5. The crossline pattern. 


Fic. 6. The semi-circle and single line pattern. 


forms or straight lines, she could differentiate 
those who preferred the "feminine" from those 
who preferred the "masculine." Finally, Immer- 
gluck and Cohen (11) have found a definite rela- 
tionship between stimulus words such as “father” 
and vertical lines. 

The straight line was thus selected as the 
symbol for "maleness." The drawing responses 
based on this pattern were expected to reflect 
the subject's atitudes toward males and the 
nature of his or her experience with males, 

z. The crossline pattern. (Fig. 5). "Male so- 

ciety" must be considered as basic a human ex- 
perience as is "female society." The same logic as 
was used in the selection. of "female society" as 
an area of common experience was hence applied 
the symbolization of "male society" also. 
]t was considered that, if one line represents 
“maleness,” two crossing lines might be assumed 
to represent some form of interaction between 
males. The pattern here evades the possibility 
of association with such consciously perceived 
cultural patterns as a cross or the letter “ж 

If the basic hypothesis is correct, this pattern 
e drawings having refer- 
to another man and, in 


to 


was expected to stimulat 
ce, in a male subject, 


en Š 
to men in general. 


female subjects, 
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Fic. 7. The lines and semi-circles pattern. 
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6. The semi-circle and single, line pattern. 
(Fig. 6). Interaction between the two sexes is as 
universal as the existence of both sexes in any 
society. 

In order to symbolize such interaction, a com- 
bination could be made of the previously dis- 
cussed symbols for "maleness" and "femaleness." 
The combination as given in this stimulus- 
pattern should imply relationships with members 
of the opposite sex, if the meanings ascribed to 
these figures individually ате as claimed. 

7. The lines and semi-circle pattern. (Fig. 7) 
The attitudes expressed in any intimate hetero- 
sexual situation are not always the same as those 
shown in a competitive heterosexual situation. 
However, the latter may also be assumed to bea 
common area of experience. 

The field, in this case, contains several hypo- 
thetical symbols for "maleness" and “female- 


Fic. 


8. The incomplet 


ness." It seemed possible that this design might, 
in both sexes, evoke reactions to a competitive 
heterosexual situation. 

8. The incomplete “U” pattern. (Fig. 8). An 
attitude toward the self is another universal 
among human beings living in groups. A stimu- 
lus-pattern which offers an opportunity for the 
projection of the self-concept must be a figure 
which is not like the hypothetical symbol for 
“maleness” nor like the one for “femaleness.” 

The incomplete "U" pattern fulfills this de- 
scription because it is composed of elongated 
lines which, at the same time, are somewhat 
curved. This enables the subject to project 
either pattern with equal ease. The "U" shape 
seemed to be such a pattern. Furthermore, this 
figure possesses the quality of inclusiveness 
which might provide a subliminal cue to the 


subject in answer to the question: "What is con- 


e “U” pattern. 
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Fic. 9. The diffuse mass pattern. 


tained therein?" Thus, in terms of projection 
theory, as applied in current techniques (sec 
section on Projective Techniques, last chapter), 
the “U” pattern might elicit a response to the 
question "What is contained within you?" The 
subject can respond to this question apparently 
only in terms of himself. 

The figure used here is a "U"—left open at 
the bottom to permit greater freedom for the 
subject. 

9. The diffuse mass pattern. (Fig. 9). It is 
assumed that all people at times meet situations 
which cause uncertainty of feeling, or present a 
threat of some kind. This element of uncertainty 
is further assumed to carry with it negative 


affect. The writer considers this to be what is 
meant by anxiety. In this sense, anxiety-provok- 
ing situations are universal. 

Rorschach discovered a relationship between 
diffuse, grey, formless stimuli on the one hand 
and feelings of anxiety on the other. The 
diffuse mass pattern was hence used here as » 
stimulus representing ап anxiety-provoking situa- 
tion. 

The assumption was not that everyone suffers 
from a "state of general anxiety." It is the way 
in which the subject copes with anxiety-arous” 
ing stimuli that was expected to be ascertain 
by the response to pattern 0. 

10. The inverted “V” pattern, (Fig. 10). Bee 
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Fic, 10, The inverted “V” pattern. 


cause of the importance of sex-role patterning in 


any society, it was assumed that there are à 
xual signif- 


sufficient number of situations with se 
icance (causing uncertainty of reaction and/or 
evoking anxiety) for sexual anxiety to be consid- 
ered a special area of human experience. 


The pattern was suggested by the analysis of 
ts, In. their dreams, 


dreams of psychiatric patien 
the inverted "V" pattern was related to sexual 
anxiety. This pattern was therefore included 
in the experiment to test. the commonness of 
this association. Perhaps it is the formlessness 
of the pattern that is responsible for its anxiety- 
provoking function. Its discrete clements, which 
may be regarded аз either masculine (angular 
points) or feminine (hollow figures), would prob- 
ably be responsible for any sexual connotation 
it might have. 
As in the case of general anxiety, the inter- 
pretation here was in terms of the individual's 
manner of coping with such situations as tend 
to provoke specifically sexual anxiety. w. 
11. The jagged-line pattern. (Fig. 11). Hostility 
and aggression may be assumed to bea common 
experience of human individuals in OUT society, 
and in a multitude of societies other than our 
own. 


The study by Immergluck and Cohen (11) 
confirms the plausibility of the assumption that 
the jagged-line figure evokes responses of aggres- 
sion. These investigators have reported that 
jagged lines were frequently associated by their 
subjects with such words as "war" and “aggres- 
sion." Dream material from psychiatric patients 
also confirms this assumption. The association 
between à jagged line and the concepts of speed 
or force might be responsible for the tendency 
of this pattern to disclose hostile or aggressive 


impulses. 


Fic. 11. The jagged line pattern. 


CHAPTER II 


PROCEDURE 


A. PRESENTATION OF STIMULI 


HE first step in approaching the sub- 
I. was to make it clear to him that 
what he was about to do was in no sense 
a test of his art ability. When pressed 
for an explanation, we have occasionally 
told subjects that the procedure was a 
"kind of test of ingenuity." However, 
almost always subjects were content with 
the instructions as given. The usual state- 
ment was as follows: 


You will notice that the first sheet here has 
nothing on it! You may draw anything at all with 
these pencils. There will Ье no time limit; take 
аз long as you wish. When you have finished the 
first drawing, turn the first sheet and take the 
second sheet by this corner (demonstrating) and 
flip it over. [This procedure was to accomplis 
two things: one, to prevent the subject from 
seeing the stimulus-pattern before he was ready 
to draw upon it; and, two, to make certain that 
the pattern was seen in the same position by all 
subjects.] On the rest of the sheets, you will notice 
there are some lines already present. You may 
again draw anything that suits your fancy, but 
on these sheets try to include the lines, if pos- 
sible. 


The order of presentation of the stim- 
ulus material in two-thirds of the cases 
avoided placing a stimulus-pattern im- 
mediately after one with a related hypo- 
thetical meaning. For example, the 
"maleness" pattern (Fig. 4) was separated 
from the "male group" (Fig. 5) pattern. 
This was to prevent carry-over of ideas 
which might make it difficult to establish 
the stimulus-pattern to which the sub- 
ject was reacting, The one purposeful 


*In the presentation of the stimulus-patterns, 
each subject was given a blank sheet of paper 
on which to make a “free drawing.” The pur- 
pose of this was to offer an Opportunity for 
expression of attitudes which were brought into 


the experimental situation. Analysis of these} ' 
drawings is not necessary to test the hypothesis of, ‘ 
universal symbolism. ny 
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exception was the placing of the sexual 
anxiety stimulus before the one for gen- 
eral anxiety. It was feared that otherwise, 
especially in our society, a subject might 
react to the general anxiety stimulus in 
sexual terms, when he could react in 
other terms if first given a chance to 
"drain off" sexual anxiety. There re- 
mains a question of whether placing the 
sexual anxiety stimulus first might not 
arouse feelings which would then arti- 
ficially influence performance on the 
general anxiety pattern. The writer be- 
lieves that the answer lies in the release 
of tension provided by an expression of 
attitude or “catharsis.” Since the subject 
was not merely expected to respond to 
certain stimulus-patterns (as in Ror- 
schach or TAT) but given an opportun- 
ity for individualized expression by even 
changing the stimuli, it was felt that a 
certain order of presentation of the stim- 
ulus-patterns was indicated. To justify 
this order, and as a control, random 
order was used in the last third of the 
cases, 

When the drawings were completed, a 
sheet of paper bearing the inquiry ques- 
tions was presented with enough paper 
for the subjects to answer each question 
fully for each symbol. The questions 
read as follows: 

Question A. What is this a picture of? 


Question B. Was it relatively easy or hard to 
draw? 

Question C. Do you like it? 

Question D. What mood does it represent? 

Question E. (1) Describe in detail the person- 
alities of any people (or animals) depicted. (2) 
What are they doing? (3) What are they think- 
ing and feeling? 

Question F. What mood do you feel looking 
at it? 


Question С. What does it remind you of? 
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The average amount of 4jme taken for 
the entire procedure by each subject was 
one hour. 

When it was possible to do so, notes on 
the manner in which the subject worked 
were appended to cach record. For ex 
ample, when the subject became sud- 
denly tired while working on one symbol 
and expressed the fact to the test super- 
Visor, the matter was noted, particularly 
in terms of the symbol upon which the 
Subject was working at the time. For 


most of the cases used in this study, of 
such 


course, it was not possible to make 
Ities 


observations because of the difficu 
encountered in group administration. 

A When the drawings and answers to the 
Inquiry questions had been collected, 
the next step was to analyze them in 
terms of the hypothetical meanings of 
the stimulus-patterns. "The section follow- 
ans describes in detail what this process 
involved, 


B. Anatysis OF STIMULUS MATERIAL 


Three major sources of data were avail- 
able for the interpretation of these stimu- 
lus-patterns, The first was the manner of 
drawing; the second, the content of the 
drawi ng; and the third, the verbal associ- 
ations as revealed by the subject in state- 
ments on the inquiry. 
stimulus sheets 


is in the analy- 
itself. Valuable 


чы manner of drawing on the 
sis ^r. given systematic emphas 
inforr ee single drawing by f 
.ormation about a total personality may be 
дав by the application of free-drawing- 
Ram techniques to the manner of drawing. 
is study, however, 2 complete understand- 


: j ү; 

me of the personality of each subject was por 

quired. For this reason, the manner of drawing 
ed to be out- 


Pun analyzed only when there seem ecd 

th nding differences between one drawing E 

МЯ Test of the subject's set of responses. 3 

санта атап matters as the use O° & vs 

dies of a particular drawing Ju 
1 to the quality of drawing ОП the о 


sheets, and the general Gestalt .of the drawing, 
ie, whether there had been closure of the figure 
and whether the whole sheet was used, etc., were 
taken into account. In this context, such factors 
as these helped clarify the subject’s responses to 
the individual stimulus-patterns. 

The content of a drawing is often obviously 
related to the hypothetical symbolic meaning of 
the stimulus-pattern. Thus, for example, one 
subject drew a baby-in-bunting out of the con- 
centric semicircle pattern, and labelled it “A 
baby which unfortunately looks like an adult.” 
The implication here is that the subject ac- 
knowledges mother-dependence and yet accepts 
status as an adult. The inquiry, of course, was 
essence. But even where no inquiry was 
conducted (as in the group of normal children), 
the subjects often spontaneously labelled their 
drawings. When they did not do so, the content 
was sometimes so obscure that it was impossible 
to tell what it was, or what it might mean to 
the subject. At times, of course, it may be possible 
to make some interpretation about the person- 
ality from the content of the drawing alone, 
even when the content is not quite as obvious as 
in the example cited at the beginning of this 
paragraph. For instance, when a drawing is 
labelled, but no further inquiry is available, 
the following might still be interpretable at the 
risk of subjectivity. For example, lollypops might 
be presumed , to imply positive affect; targets 
might be said to imply goal-direction; moons 
might be presumed to connote remoteness; etc. 
Such interpretations, however, offered without 
subject's verbalizations, could never be made 


of the 


the 
without misgivings. 

Answers to the inquiry questions provided 
verbal associative material. Consequently, when 
answers were available, as they were for most 
cases in the study, they made interpretation de- 
fensible. The answers to the questions as to 
what the drawing Was, whether it was easy or 
difficult to draw, and whether or not the subject 
liked it, details about the "personalities" de- 
picted, and any other material which the subject 
might associate with his drawing, were straight- 
forwardly given by most subjects. The replies to 
the questions on liking the picture and ease 
of drawing it were always usable for further 
interpretation. Various other replies could be 
analyzed in terms of the subject's liking or dis- 
liking of the picture and his ability to draw 
easily when presented with a particular pattern, 
The possibility suggested itself here that a 
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subject might like a particular stimulus-pattern, 
but still not draw well upon it, nor like his own 
product. When this happened, subjects were 
likely to specify that they disliked their execu- 
tion of the pictures, but not the concepts they 
had in mind. The subjects themselves thus 
helped in the interpretation of the inquiry. 
The two questions about "mood" (mood of 
the picture and mood it evoked in the subject) 
provided further information about how the 
subjects evaluated their own responses to the 
stimulus-patterns. It is interesting to note that 
the responses to the two questions about mood 
were usually not the same, although they were 
most often complementary. An example of this 
is the statement that the mood of a given picture 
was "horror," while the feeling evoked in the 
subject was one of "considerable discomfort." 
When the subject is in conflict about the symbol- 
ized area, the various inquiry questions usually 
stimulate different affectual tones in the various 
replies. For example, the mood of a given pic- 
ture could be "horror," while the feeling evoked 
in the subject might be described as one of 
"fascination." The answers to the questions 
about liking the picture and the ease with which 
it was drawn are basic to the rest of the analysis. 


The method of analysis must also be 
understood in terms of the differences of 
approach required by the different stimu- 
lus-patterns. Following is a discussion of 
the analysis in terms of each stimulus- 
pattern. 


The semi-circle pattern. (Fig. 1) has been as- 
sumed by other investigators to evoke responses 
to “femaleness.” In order to make specific in- 
terpretation of a subject's responses to the 
symbol of "femaleness," it is, however, necessary 
to establish a relationship between the concept 
and certain areas of the person's developmental 
history. The assumption was that responses to 
the concept of “femaleness” could be proved to 
be based upon contact with members of the 
female sex. It was further assumed that the 
subject's concept of “femaleness” could be shown 
to be influenced by his or her earliest or strong- 
est expcriences with a member of the female sex. 
Since, for most people, this member would be 
the mother and/or mother-surrogate, the in- 
terpretations based upon the drawings on the 
semi-circle pattern were taken to relate to the 
mother and/or mother-surrogate. 


Similar logic applies to the straight-line pat- 
tern (Fig. 4). This pattern is related to “male- 
ness.” Here, as in the case of the symbol for 
"femaleness" (Fig. 1), it was assumed that the 
subject's modification of the symbol might be 
in terms of his generalizations from the male 
figures who had made the strongest impression 
upon him. In most cases, this would be the 
father and/or father-surrogate. 

In connection with the figures for “malencss” 
(Fig. 4) and for "femaleness" (Fig. 1), there are 
two points of theoretical interest. The first is the 
extent to which identification on the part of 
the subject with either parental figure might 
influence his drawing. The self-concept might 
thus enter to a degree, In other words, the sub- 
ject might identify with the one or the other of 
his parents. Then, in reacting to the appropriate 
stimulus-pattern, he might indicate character- 
istics which are true of himself, but not of the 
parent. When this happens, however, he still 
reveals his concept of “maleness” or “femaleness,” 
as the case might be. For example, a boy who 
had identified with his father drew a "scale of 
justice" out of the straight line. On the inquiry, 
he stated that it was easy to draw, that he liked 
it, that its mood was “idealism,” evoking in him 
а mood of "determination." Finally, it reminded 
him of the “frailties of man." ` 

The interpretation was that there was a pos- 
itive relationship between the subject and his 
father, and that the subject. somehow derived 
from his father his ideals and ambitions. From 
the last association on the inquiry, howevcr, 
it seemed that the father probably had shown 
some significant weakness which was important 
to the boy. 

Case history material brought out the follow- 
ing facts: When the boy was a year old, his 
father was dismissed from the police force for 
bribery. The relationship between father and 
son was a good one, and the boy adopted his 
father's walk and manner of speech. At the time 
of testing, when the boy was seventeen years old, 
he expressed interest in law with the idea of 
regaining honor for his family. He was already 
well known in his comunity for his honesty: 
forthrightness, and high ideals of citizenship- 
Thus it may be seen that a subject may show 
clear identification with the parent and on the 
appropriate symbol include his own traits as well 
as information about the parent. 

The second point of theoretical interest COP 
cerns those cases where the subject drew male 
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figures on the semi-circle (Fig. 1) and female 
figures on the straight line (rig. 4). The as- 
sumption made here was that the subject was 
still expressing his concept of "maleness" or 
"femaleness," but that he viewed these in terms 
of reversal of the usual roles. Thus the drawing 
of a man's face on the semi-circle has been shown 
in various cases to be related to experience with 
a domineering mother. 

At first the writer considered the possibility 
that a person might respond to the "femaleness" 
pattern (Fig. 1) in terms of a male figure because 
of association with weak males. Similarly, it did 
not appear certain that a female figure drawn 
on the straight line referred to weak males rather 
than to strong females within the realm of the 
subject's experience. 

The inquiry material resolved this dilemma. 
It was usually clear from the answers exactly 
which parental figure was involved. For cxample, 
опе subject drew à grim-looking bald-headed 
man out of the semi-circle (Fig. 1). The straight 
line became part of a hotel described as being 


in total darkness because all of "the lights are 
one is over at the tavern across 


the bald-headed man was 
ho knows his stuff,” 


out, and every 
the street.” Further, 
called a “business man W 
and while the mood evoked in the subject was 
one of “admiration,” yet he stated that he did 
not like the picture. On the other hand, the 
Mood evoked in the subject by the straight-line 
drawing was that of “longing,” and he expressed 
a liking for the picture. 

The case history showed that the mother was 
а strong, determined woman who managed the 


family budget. The father was characterized 


as a relatively weak man who worked as a travel- 
not controlled 


ling salesman, and tended, when 
by his wife, to seek satisfaction in drink. The 
Subject was described as being more fond of his 
father than of his mother. However, the father's 
Occupation prevented close companionship with 
the subject. 
This kind of check, bolstered by numerous 
examples, lent strong support to the hypothesis 
(see case analyses and validation data) that me 
semi-circle (Fig. 1) is related to “femaleness. 
The crossline pattern (Fig. 5) and the counter- 
онеш semi-circle pattern (Fig: 3), —— 
Y both represent plural situations, WOU 
Seem to focus the subject's attention upon spl 
Aspects of his attitudes toward «maleness” and 
; o maleness,” rather than his attitudes toara 
Ndividual figures who might be prototypes ° 


his concepts of “maleness” and "femaleness." 
Thus, the interpretations made (see validation 
material and case analyses following) on the 
basis of the drawings evoked by these two 
stimulus-patterns relate to the subject's attitudes 
toward the male and female members of his 
group, as he feels toward them “en masse,” so to 
speak. The kind of social technique which an 
individual may use often varies with the sex 
with which he is dealing. This kind of difference 
stands out clearly in the drawing material and, 
when it occurs, note is taken of it in the analysis. 
For example, on the counter-opposed semi-circle 
pattern (Fig. 2), one subject drew a line connect- 
ing the two arcs at the top. This he called a 
table. The crossline pattern (Fig. 5) he turned 
into a pair of sabres held by two opponents. 
From the content of the drawing and the 
inquiry material, it was concluded that the sub- 
ject considered himself to be a source of support 
to women in the “strong silent” manner. The 
manner of drawing was of help in making this 
interpretation also, because it has been observed 
tKàt use of the space between the areas seems 
to be characteristic of those who wish to estab- 
lish that they are not identified with the female 
group. The interpretation on the second drawing 
was that the subject was generally antagonistic 
to those of his own sex, Case history data, and 
the information given by the subject's psychia- 
trist for validation purposes, confirmed these 
interpretations. 

There are two stimulus-patterns that are com- 
binations of the male and female symbols. The 
first one to be considered here consists of оле 
semi-circle and one straight line (Fig. 6). Being 
combination of the male and female symbols, 
this pattern, in the relationship represented, 
appeared as if it might have relevancy to the 
intimate heterosexual situation (see case analyses 
and validation data). The kind of specification 
on this symbol that a given subject makes de- 
pends presumably upon the range of his ex- 
perience and often on his age. The subject may 
or may not relate this stimulus-pattern to ex- 
periences of physical contact. The subject's 
interpretation tends to be in terms of his atti- 
tude toward a close relationship between two 
one of each sex. 

The second combination consists of an ar- 
rangement of more than one of each sex-symbol 
in the field (Fig. 7)- This arrangement is such as 
est competition between members of the 


to Sugg: a 
same sex for those of the opposite sex (see tabular 


a с 


people, 
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material on this pattern). Again, the degree to 
which sexuality, in the sense of physical contact, 
is involved depends on the subject. 

An example of how the attitude toward the 
intimate heterosexual situation may vary from 
the one expressed for the competitive hetero- 
sexual situation is the following: A subject drew 
on the intimate heterosexual stimulus-pattern 
what he called "a balloon drifting lazily toward 
the earth." On the competitive heterosexual 
stimulus-pattern (Fig. 7) he drew a "dart game 
in which each guy sees who can get more." The 
interpretation was that the subject felt stimu- 
lated by competition with other men for women, 
but his aggressiveness did not extend to the 
intimate level, where he developed a more pas- 
sive attitude. This analysis was in accord with 
the subject's own appraisal of his behavior. Of 
course, other subjects show the opposite attitude, 
withdrawing from the competitive heterosexual 
situation, but becoming assertive when secure in 
an intimate heterosexual situation. Most often, 
the attitudes expressed on the two stimulus- 
patterns are. complementary. nt 

The incomplete *U" pattern (Fig. 8) permits of 
ready projection of a face, but also of a different 
use, if the subject is not inclined to draw a face. 
This seems to offer an opportunity for the ex- 
pression of attitudes relating peculiarly to the 
self. Again, the method of modifying the symbol 
is taken into account in the analysis. However, 
particular stress is laid upon the content of the 
drawing in making interpretations as to the self- 
concept. 

The most frequent response is the drawing of 
а vessel of some sort, usually a glass or vase. 
Such containers are passive receptacles, and 
imply an attitude of passivity on the part of 
the subject. At least, we have so interpreted 
them in our analyses (check against tabular 
material on the pattern involved). A striking 
example was the drawing of one malc subject 
who labelled the main figure a pot, drew a fire 
underneath, and stated that it was the “соттоп 
pot for all the bums who keep taking from it." 
The case history shows that the subject was a 
member of a large family in which he was known 
as the "good one." He was considered to be the 
person to ask for any favors. Particularly inter- 
esting was the fact that each member of the 
family said that he was shamefully imposed upon 
by all the others, but no one admitted imposing 
upon the subject. Living with his younger 
brother in college, he did the housework and 


cooking for both; and also lent his brother much 
money from his*bwn government allowance, even 
though his brother was receiving the me 
monthly amount. At the time the drawings Were 
administered, the subject was secking psychiatric 
aid in adjustment to marriage. He felt that 
advantage had been taken of him in arranging 
the marriage, and he felt that he "suddenly 
couldn't take it any more." These facts would 
seem to confirm the interpretation of his draw- 
ing, which was that the subject felt himself 
to have played a passive role in which others 
had taken advantage of him. Not all subjects 
who draw passive figures on this stimulus-pattern 
indicate conflict about it, of course. Also, draw- 
ings other than those of vessels may indicate 
a passive element. В 

Often faces are drawn, and in the inquiry 
they may bring forth any emotion in the human 
range. The guiding principle in interpreting 
reactions to the incomplete-U pattern is that 
the self-concept alone is revealed. If an indi- 
vidual has misconceptions about himself, they 
will be in evidence. Thus, a strikingly beautiful 
girl with an LQ. of 152 (Wechsler-Bellevuc) 
and various college honors, including Phi Beta 
Kappa, said on the inquiry about the face she 
had drawn on the incomplete-U pattern (Fig. 
8): "She is silly, shallow, vain, and quite un- 
worthy—not even really pretty." When asked 
for her real opinion of herself during the inter- 
view this subject spent eight minutes telling 
why she was not actually as personable as most 
people seemed to believe, Naturally, miscon- 
ceptions about the self can be in the direction 
of self-flattery, also. In any case, it is how the 
subject regards himself that is sought in the 
validating data for the incomplete-U pattern 
interpretations, 

In the symbols for sexual anxiety (Fig. 10), 
general anxiety (Fig. 0), and aggression (Fig: 
11), the analysis was focused on two possibilities: 
First, and most important, the manner of han- 
dling, i.e., the degree of structuralization added 
by the subject, and the way in which this was 
accomplished. By way of example, we might take 
the subjects who do not draw anything on these 
stimulus-patterns. Failure to use this pattern 
might mean that, when confronted by an anxiety- 
provoking situation, the subjects could not focus 
9n them even though told to try to do 50: 
The analysis, then, should be that the subject 
could not deal effectively with the type of situa- 
tion symbolized. 
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In other cases, the subjects made a stimulus- 
cem an integral part of a ^nore complete 
q wing: Thus, the aggression-stimulus (Fig. 
11), might be used as the side of a building. In 
such an instance, the analysis would be that 
the subject reacts to aggression situations con- 
сер аѕ much because of the constructive 
the stimulus-pattern as through the 
content of the drawing. 
qu s MET aspect of the analysis referred to 
peg Br the drawings made on these three 
might о It was felt that the content 
tiled eal the kind of experience the subject 
to relate to the stimulus-pattern. Thus, 
А subject who called the general anxiety 
stimulus-pattern “dirty combings” was considered 
ч be concerned with matters of cleanliness. On 
M E inventory form, this subject indeed 
ated: "Cleanliness is next to godliness.” She 
Was under treatment for compulsive symptoms, 
including compulsive washing. 
The semi-circle within a 
Pattern (Fig. 3), or regression symbol, relates to 
the subject's attitudes toward the mother. How- 
Сует, it is especially important in revealing the 
subject's security strivings as contrasted to а 
Sense of emotional independence. All interpreta- 
tions contain reference to this point. The 
manner in which the subject handles the stimu- 
lus-pattern is frequently important in analyzing 
responses to this stimulus-pattern. For example, 
it is possible to “close” the figure in different 
ways. One way is to complete both circles (€g» 
automobile tires). Regardless of the implications 
of the content, completion of the circles in this 
manner could be interpreted to mean a de- 
pendency (regressive) reaction. The logic here 
is that the structuralization involved tends to 
emphasize the circle-within-a-circle pattern. Some 
Subjects connect the parallel ends of the two 
Circles (eg., a "bitten-off doughnut"), emphasiz- 
ing emptiness. Here interpretation must take 
into account the fact that the subject has given 
Up expecting maternal protection. Emotionally 
independent, he feels free to manifest aggressive 
behavior, This kind of analysis has been sub- 
Stantiated by the validating material. 
The importance of the content is evident in 
this pattern also, The inquiry material often 
ti light on sibling relationships a$ well as 
тў l attitude toward mother dependency, Со 
ш. EE eight made а "fat Cat Б. eon 
ned in. Bede де нани не а Де 
by a sudden deflation. The interpre 


larger semi-circle 


was that he had been relatively secure and 
dependent in attitude, but that a recent shift 
in interpersonal relationships (probably with 
his mother) had suddenly left him feeling in- 
secure. Case history material showed. that his 
first younger sibling had been born three months 
prior to the time of drawing. As the youngest 
member of the family until this happened, the 
subject had been the "family pet," and used to 
spend most of his free time trying to help his 
mother with her housework. He was described 
as bewildered by the sudden loss of constant 
maternal attention. These facts support the 
interpretation of his drawing. т 


Further examples of the method of 
analysis are given in two complete cases 
included in a later chapter. 


С. VALIDATION DATA FOR EACH 
SUBJECT GROUP 


о 
This study comprises data on 169 sub- 


jects. Of these, 157 are white Americans 


and 12 аге American Indians. They are 
almost evenly divided as to sex, and 
range in age from six to sixty-nine. 

In all cases, blind analyses of the draw- 
ings were made first. Then, specific inter- 
pretations were checked against validat- 
ing data. The sources of validation 
varied with the group, and are com- 
pletely described in this section. The 
same rating system was used in all 
groups, however, to express degree of 
agreement between the analyses and the 
validating data. This system is fully de- 
scribed in the following sections. It pro- 
vided a checking system in terms of the 
following possibilities: Complete Сот- 
roboration, Partial Corroboration, Con- 
flicting Evidence, Partial Conflicting Evi- 
dence, Complete Contradiction, Partial 
Contradiction, and No Evidence. 

The subject groups and data available 
for checking the validity of drawing 
interpretations follow: 
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Group No. 1: NORMAL CHILDREN 

Description: This was a group of 30 
public school children, evenly divided as 
to sex, and all approximately 19 years 
old, considered a part of a "normal" 
population. 

Method of Validation: This group has 
been the object of concentrated study as 
part of a program of cooperative effort 
by psychologists, social. anthropologists, 
sociologists, and other members of the 
Committee on Human Development at 
the University of Chicago. The various 
workers, operating independently, em- 
ployed different techniques of study. The 
kinds of materials which were utilized 
and analyzed independently by the vari- 
ous members of the conference team were 
as follows: case history material; inter- 
view data obtained from parents, teach- 
ers, peer groups, and subjects; socio-eco- 
nomic class ratings; sociometric data; 
Rorschach test; Thematic Apperception 
"Test; personal inventory tests, such as the 
California Personality Test; essays on 
such topics as “The Person I'd Most Like 
to be Like"; and material on the develop- 
ment of moral conscience. A year after 
investigations were completed, the stim- 
ulus-material for this study was adminis- 
tered. A clinical case conference was held 
in each case, at which time there was a 
critical examination of the findings ob- 
tained by each technique (excluding the 
present ones) The summary transcripts 
of these conferences were used as validat- 
ing data on the present group of cases. 

The drawings completed by the chil- 
dren were analyzed without knowledge of 
more than their age and sex. Inquiry 
questions were not administered to this 
group. The interpretations derived from 
the drawings were rated with reference 
to degree of agreement with the clinical 


case conference conclusions. In each case, 
the investigator compared her interpre- 
tations with the independent clinical 
summaries. Each point of the present 
diagnoses was rated on degree of agree- 
ment with the validating data. The rat- 
ing system is described in a later section. 


Group No. 2: NORMAL ADULTS 

SUBGROUP A—VOCATIONAL GUIDANCE 
GROUP 

Description: There were two sub- 
groups making up the normal adult pop- 
ulation of this study. One consisted of 
people who applied for vocational guid- 
ance at a general psychological clinic. 
There were 13 men and 11 women, rang 
ing in age from 17 to 61 years. None of 
this group of subjects exhibited a psy 
chiatric syndrome, either clinically or 1n 
test findings. They are consequently re- 
garded as members of an_ essentially 
“normal” population. 

Method of Validation: All members of 
this subgroup were given a battery of 
tests investigating personality structure, 
including: the Rorschach Test, ТАТ, 
Szondi Test, and the California Person- 
ality Test. In addition, there was a pr? 
longed conference between the subject 
and a professional psychologist, during 
which the test results were discussed: 
This psychologist used the test findings 
and his notes on the conference as the 
basis for his rating of the writer's blind 
interpretations. As in the other groups 
each point of interpretation was com- 
pared with the validating data and rate 
on degree of agreement with these data- 


Group No. s: NORMAL ADULTS 

SUBGROUP B—VOLUNTEER NORMAL ADULT 
GROUP 

Description: 'The second subgrouP 
consisted of 30 adults, evenly divided a 
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to sex, and ranging in age from 18 to 49 
years. They had all had some amount of 
college training, and all had volunteered 
to take part in this study. None of this 
group had ever been required to seek 
psychiatric aid, and therefore they were 
considered to be members of a relatively 
“normal” population, Some members of 
this group were doing advanced work in 
psychology. None of them, however, indi- 
cated any familiarity with the theoret- 
ical implications of the drawing-task. 

Method of Validation: In many cases 
of the other groups (such as 3 and 4), the 
Interpretations were concerned with 
material which, because of their emo- 
tional conflicts, the subjects could not be 
expected to discuss. It was, therefore, con- 
sidered desirable to have a group of 
Normal subjects for whom conscious 
acceptance of the interpretations might 
be the criterion for validation. The anal- 
Yses were written in a manner providing 
Specific points for judgment. Generalities 
Such as "manner of handling anxiety" 
Were carefully avoided. As much as possi- 
ble, the subjects were asked non-leading 
questions, and encouraged to elaborate 
Upon their answers. 


Group No, g: CHILDREN WITH 
BEHAVIOR PROBLEMS 
Description: This group con 
18 children who were under treat’ 
for behavior disorders. They ranged in 
age from 6 to 16 years, and were nearly 
evenly divided as to sex. Their symptoms 
ranged from relatively mild to relatively 
more severe types. As in the case of the 
ш psychiatric group (see below) no 
SPecial account was taken of the indi- 
vidual diagnoses. These subjects were 
Sing treated by two therapist — 
Method of Validation: The 15 


sisted of 
tment 


t of 


interpretations based on the drawn ma- 
terial was submitted to the therapist 
treating the case. The therapist, using 
his clinical notes for reference, scored the 
interpretations in terms of the system 
previously described. 


Group No. 4: ADULTS IN PSYCHO- 
THERAPY 

Description: This group consisted of 
6o adults, men and women, ranging in 
age from 17 to 69. All of them were 
under treatment for psychiatric disorders 
at the time the drawing material was ad- 
ministered, These subjects were treated 
by three therapists. The diagnoses and 
symptoms of these patients varied from 
the mild neuroses to diagnosed schizo- 
phrenia. However, for the purpose of this 
stedy, no differentiation was made 
among the members of this group in 
terms of types or symptoms. The rela- 
tionship between specific disease types 
and responses to the stimulus pattern 
must be left to future research. The pur- 
pose of including these subjects is to test 
the hypothesis that the symbols are mean- 
ingful to people with emotional aberra- 
tions in the same way as they are to 
others. 

Method of Validation: The list of 
interpretations based on the drawn ma- 
terial was submitted to the therapist 
treating the case. The therapist, using his 
clinical notes for reference, scored the 


interpretations in terms of the system 


previously described. 


Group No. 5: INDIAN GROUP 
Description: This was a group of 
twelve children, all approximately thir- 
teen years old. There were eight girls 
and four boys. Seven of the children be- 
longed to the Navaho, Zuni, and Papago 
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tribes. These are all matricentric tribes. 
The others belonged to the Sioux tribe, 
which is patricentric. There are other 
differences, in that the Sioux are con- 
sidered to possess a disintegrating cul- 
ture, while the others are not. 

Method of Validation: 'These children 
were part of a group studied in 1945 in 
the Research on Indian Education of 
the Committee on Human Development. 
At that time, material was gathered in 
the form of autobiographical notes, case 
histories, T'AT, Rorschach tests, and free 
drawings. The process of using this ma- 
terial for validation purposes was the 
same as in the case of the normal chil- 
dren's group. 


D. SYSTEM FOR VALIDATION-RATING 


Seven rating categories were used vo 
indicate the relationship between the 
interpretations on the drawing materials 
and the validating material As previ- 
ously noted, these were Complete Cor- 
roboration, Partial Corroboration, Con- 
flicting Evidence, Partial Conflicting Evi- 
dence, Complete Contradiction, Partial 
Contradiction, and No Evidence. 

The same system was used for all the 
groups regardless of the source of validat- 
ing material. Fuller explanations of the 
rating categories are given below. Ex- 
amples taken at random from several 
different cases are included. These cases 
are all from the Normal Children's 
group. It will be recalled that validating 
statements were available for direct quo- 
tations from the transcripts of Rorschach, 
TAT, and other test records, and from 
conclusions of independent workers in 
a clinical case conference on each case. 

Plus (+ ) was used to indicate complete 
corroboration of the interpretation by 
the validating material. Only when the 
nature of the evidence consisted of an 


identical paraphrase of statements used 
ol H H 

in the blind interpretation was this scor- 

ing used. 


Examples of Plus Scoring 


1. Interpretation: 
"Handles anxiety generally through evasive 
generalization." 
Validation: 
"Anxiety motivates over-all adjustment pat- 
tern. Seen in all projective materials in a 
hesitancy, evasion, and cover-up pattern." 
(Summary transcript of clinical case con- 
ference) 
. Interpretation: 
"Sexual anxiety controlled: fantasy meth- 
ods." 
Validation: 
"Sexual anxiety controlled through fantasy." 
(Summary transcript of clinical case con- 
ference) 
3. Interpretation: 
"Finds security in companionship with 
those of same sex." 
Validation: 
"Personal achievement (sports) closely tied 
to sense of loyalty to group and need of 
recognition from the group." 
"Close identification with young men." 
"Plays with older boys on equal terms." 
"s. . He is aware of masculine peer group 
expectations and trying to live up to them. 
(All the above quotations are from the clin- 
ical case conference) 
4. Interpretation: 


5 


“Self-concept: very masculine." 
Validation: 


"Very high masculine score." (Masculinity- 
femininity profile) 
"Strong male self-concept." (Essay material) 
Parüial-Plus (Р-р) was used to indicate 
partial corroboration, that is, partial evidence of 
a corroboratory nature. In all cases where there 
Was some question as to whether a “+” OF ® 
“P+” was applicable, the “P4” was used. 


Examples of Partial-Plus Scoring 
1. Interpretation: 
“Playful attitude as method of adjusting 
to competitive heterosexual relations, al- 
Ways with controlled attitude." 
Validation: 
"Satisfactory adjustment to heterosexual 
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role on interest level.” /nterpretation of 
California Personality Test in clinical case 
conference) 
2. Interpretation: 
"Feels mother-rejected.” 
Validation: 
“Mother impersonal.” (Summary and inter- 
pretation on interview material in clinical 
case conference) 
3. Interpretation: 
“Oral fixation.” 
Validation: 
"Breast-fed for almost a year." (Case his- 
tory) 
4. Interpretation: 
“Heterosexual relations on a close basis are 
repulsive and threatening." 


Validation: 
"Possible passive homosexual adjustment." 


(TAT) 
"Low heterosexual interest." (TAT) 
Plus-Minus (+) was used to indicate conflicting 

evidence. Sometimes, in cases where the validat- 
ing material consisted of compilations of reports 
on the subjects by different workers, the amount 
of information present in the record was not 
sufficient to explain scemingly conflicting results 
from the different techniques employed. In such 
Cases it was assumed that we could not judge 
Whether the evidence supporting or contradict- 
ing Our interpretation. was correct. The "x" 
Sign was used to denote this condition. 


Examples of Plus-Minus Scoring 


1. Interpretation: 
"Positive attitude tow 
sexual contact." 

Validation: 
"Sceks transference from mother to 45-year- 
old neighbor woman with whom he gos 
fishing.” (Case history interpretations in 
clinical case conference) 
"Not interested in girls." ( 

2. Interpretation: 

"Hostility freely expressed." 
Validation: 
“Characterized by t 
history) 
"Considered. submissi 
(Sociometric) 


£ indicate 
Partial-Plus-Minus (P+) was used ° cg i 
Partial conflicting. evidence, that 15, putes 
dence Which at once supported and contra 1 


ard individual hetero- 


Case history) 


emper outbursts.” (Case 


ve and cooperative. 


the interpretations, when this evidence was in- 
conclusive in both directions. 


Examples of Partial-Plus-Minus Scoring 


1. Interpretation: 
“Hostility toward mother.” 


Validation: 
“Mother torn between need for control and 


laxness because of subject's charm.” (Case 
history interpretation in clinical case con- 
ference) 


Minus (—) was used to indicate contradictory 
evidence. 


Examples of Minus Scoring 


1. Interpretation: 
“Expects coldness from girls.” 
Validation: 
“Girls show interest in him.” (Sociometric) 
2. Interpretation: 
"Mildly compulsive in control of sexual 
anxiety: chief source of control in attitude 
o of repulsion." 
Validation: 
“No sexual anxiety.” (ТАТ) 


Partial-Minus (P—) was used to indicate par- 
tial contradiction, that is, evidence of a contra- 


dictory nature. 
Examples of Partial-Minus Scoring 

1. Interpretation: 
"Experimental mood with regard to com- 
petitive heterosexual situation." 

Validation: 

"Prognosis toward satisfactory heterosexual 
adjustment not favorable." (TAT) 


. Interpretation: 
“Sees male figures as sources of support." 


validation: 
“Does not play with boys of peer group 
very much." (Case history) (No informa- 
tion available on father or other non-peer 


ю 


males) 
The category, No Evidence (O), was used to 


indicate lack of evidence in either the positive or 
negative direction as to the validation of the 
interpretation. It was used when no statement 
could be found anywhere in the case record 
which shed light even remotely upon the hy- 
pothesis scored. As the examples above should 
make clear, conservatism was the rule through- 
out, and any statements which seemed to modify 
the scoring at all were included. This symbol 
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was also used when the subject did not answer 
the inquiry questions, and the content of his 
drawing was obscure. 

Further examples of how this scoring system 
was used may be found in Case A in the Appen- 
dix. 


E. CHECKS FOR RELIABILITY 


The checks for reliability of procedure 
varied with the subject groups because of 
the differences in method of validation. 
For this reason, the following discussion 
has been organized in terms of the sub- 
ject groups. Subsequently, reliability 
checks on the methods of this analysis 
will be discussed. 

The Normal Children Group.—In the 
validation process for this group the 
writer selected statements from the val- 
idating data which seemed to her to have 
а bearing on the specific interpretations 
made of the subjects’ responses to the 
stimulus material. This raised two prob- 
lems of reliability. The first is whether 
the writer's selection of validating ma- 
terial was reliable; and the second is 
whether the rating made on the extent 
of agreement between the interpretation 
and the validating statements was relia- 
ble. 

The reliability check of the writer's 
selection of validating statements from 
the validation data was made as follows: 
Questions were formed out of the inter- 
pretations of fifteen of the cases. Care 
was taken that the questions not be 
“leading.” An interpretation such as 
"Hostility freely expressed" was changed 
to "How is hostility handled?"; “Mother 
affectionate” was changed to “How does 
subject interpret mother's attitude?" An 
independent worker was then given these 
questions as set up, and asked to find, 
from the complete clinical case confer- 
ence data which had been gathered, 


statements, if any, answering the ques- 
tions. 

As seen in Table 1, the checker's ma- 
terial was identical with the writer's in 
84% of the cases. The reliability of the 
validating statements is underscored by 
the remaining breakdown. The cases in 
which the checker found evidence, but 
the writer had listed none, was 2%. 

The reliability check on the writer's 


TABLE т 


RELIABILITY OF VALIDATING STATEMENTS 
(Based on 215 Judgments) 


Category No. % 

Checker cited same material 181 84 

Checker found different material 4 2 
Checker found evidence where 

writer listed none 4 a 
Checker found no evidence where 

writer did 26 12 

Total 215 100 


ratings was to give another independent 
worker the interpretations made on each 
subject, with the validating statements 
selected from the clinical material by the 
writer, and to ask her to make her rating 
independently. Her scores were then 
compared with those made by the writer. 
Complete agreement was found in 04% 
of the 76 scorings which had been cal- 
culated. Differences based on more than 
a half scoring value? were found in only 
0.7% of the comparisons. 

Adult and Problem Children Groups.— 
In the Normal Adult-Vocational Guid- 
ance Group, the Psychiatric Case Adult 
Group, and the Problem Children Group: 
validation ratings of the interpretations 
were made entirely by independent 
workers. The reliability check made for 

*“A half scoring value" refers to the relation- 


ship between partial evidence and complete evt 
dence, eg., "P" and “+” 


— n ж 


k. di 
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4j TABLE 2 
RELIABILITY CHECK BY THERAPISTS 


% Change, % Change, % Change 


G 
тнр Agreement 3 Step Full Step from “о” 
Peydhiatric Adult Group: 
l'herapist A 8 

Therapist В 96 š : 

Therapist C . 82 5 4 9 
Normal Adults, Vocational Guidance Group 93 7 о о 
Problem Children's Group: 

Therapist A 98 I o I 

97 2 o 1 


Therapist B 


these groups, therefore, was to have the 
same workers rerate a representative 
sampling of each group after two months. 
Table з summarizes the results. The per- 
centages on this table express the ratio of 
agreement between the first and second 
ratings made in the several groups. These 
ratings were made by three therapists for 
the Psychiatric Adult Group, а clinical 
psychologist for the Normal Adult-Voca- 
tional Guidance Group, and two thera- 
pists for the Problem Children Group. 

For the Psychiatric Adult Group, the 
percentages of agreements between the 
first and second scorings were g8 per 
cent, 96 per cent and 82 per cent respec- 
tively. The last figure is lower than the 
Others because this therapist made more 
scoring the second time. 


y reversed his earlier 
геп SO, 


"no evidence" 
He also completel 
Scoring on some interpretations. Ev 
these high percentages of agreement per- 
mit confidence in the reliability of the 
Validation processes used for this group. 

For the Normal Adult-Vocational 
Guidance Group, the first and second 
Scorings were identical for 93 per cent 
of the interpretations. In the remaining 
7 per cent, there were no shifts greater 
than a half scoring value. These ratings 
are therefore considered to be thoroughly 


validated for the Normal Adult-Voca- 
tional Guidance Group. 

For the Problem Children Group, 
agreement between the first and second 
scorings was 98 per cent and 97 per cent 
respectively. This extraordinary reliabil- 
ity is probably explained by the fact that, 
in children, such factors as “oral-fixa- 
tion” or “peer-group relationship” are 
more readily checked. 

The Normal Adult Group was divided 
into two subgroups. One was the voca- 
tional Guidance Subgroup, validated by 
a clinical psychologist on the basis of 
clinical interviews, and standard test re- 
sults, The second group was interviewed 
by the writer, and thus validated directly. 
Reliability of ratings for the latter sub- 
group was checked in the following man- 
ner: The percentages were computed for 
the number of interpretations of subjects’ 
responses rated in the various categories. 
These figures were then compared for 
the two subgroups. Critical ratios were 
calculated for the percentages? in the 
“complete corroboration” category. The 
ratios ranged from 0.23 to 0.47. Since 
2.00 is the usual criterion of significant 
difference, these results indicate that 


з'Тһе percentages were expressed as propor- 
tions for this purpose. 
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there is very little difference between the 
two validation results. For this reason, 
the two subgroups are considered to- 
gether. 

Indian Group.—Since the validation 
method for the Indian Group was the 
same as that for the Normal Children 
Group, no new reliability checks seemed 
necessary. 

Method of analysis.—Two checks were 
made on the reliability of the method 
of analysis. The purpose of the first check 
was to discover the ease with which an- 
other worker might learn to analyze this 
kind of material. Ten records were given 
to this worker, to whom the frame of 
reference used in the study was carefully 
explained. He was asked to analyze these 
records, and his analyses were compared 
with those made independently by the 
writer. The essential points were anal- 
yzed in almost exactly the same manner 
by both workers. For example, one sub- 
ject drew a chest over the male figure 
pattern (Fig. 4). On inquiry, he described 
the chest as being "locked, but still a 
nice piece of furniture." One interpreta- 
tion stated, “Subject has positive attitude 
toward father, but finds him inaccessi- 
ble." The other said, "Subject is fond of 
father, but finds him hard to contact." 
No directly conflicting analyses were ob- 
tained. Judgments in which different 
points appeared to be emphasized were 
never found to be actually incompatible. 
As might be expected, the writer's previ- 
ous experience with such material en- 
abled her to make twice as many inter- 
pretations as were mentioned by the 
other analyst. However, in 96 per cent of 
the interpretations. made by the two 
analysts which referred to the same be- 
havior, there was no substantial differ- 


ence in the points they had made. This 
agreement probably indicates the value of 
the inquiry statements used in guiding 
these analyses. 

The second check dealt with the prob- 
lem of whether the analysis of a record 
was modified by the analyst forming no- 
tions about the case from the first few 
responses of a subject, and subsequently 
making interpretations on the rest of the 
subject's responses in terms of these pre- 
conceived notions. Another person was 
asked to select at random a set of eleven 
subject responses to the stimulus-patterns 
from the complete materials at hand. 
This set of eleven subject responses was 
thus a composite record, in which each 
subject response was taken from a differ- 
ent case. Then the writer analyzed this 
composite as though it were a single sub- 
ject’s responses. In the analysis of the 
composite record, one interpretation was 
made about each subject response. The 
interpretations were compared with those 
originally made on the individual sub- 
ject responses when they had been con- 
sidered in their context. All eleven inter- 
pretations made in the check had been 
given in the original analyses also. Con- 
sidering the eleven stimulus-responses, 
what seemed to be the central concept in 
each subject response formed the inter- 
pretation. All eleven interpretations 
proved to have the same central concepts 
previously connected with these subject 
responses. This check is possibly some- 
what inconclusive because the writer was 
aware that the setup was artificial. 
Nevertheless, this procedure has made it 
seem unlikely that the analytic process 
is much affected by  preconceptions 
formed in advance of the author's analy- 
sis of each case. 


В: 


CHAPTER ПІ 


ЅАМРІЕ ANALYSES 


SourcE OF MATERIAL 


D order to make clear just how the 
various principles discussed in Chap- 
ter II were actually applied, two records, 
completely analyzed, are presented with 
a step-by-step explanation of the method 
used. Case А is taken from the Normal 
Children Group, in which validation was 
by clinical conference records, including 
data from Rorschach, TAT, interviews, 
etc., as listed in full above (see Chapter 
II). No inquiry material was obtained in 
this group, except for the spontaneously 
given labels supplied by some subjects, 


as in this case. The reasoning used in 
arriving at interpretations is indicated. 
The quotations from the clinical confer- 
ence record relating to the interpreta- 
tions are fully presented. Case B is from 
the personal interview group. The com- 
plete inquiry is quoted with explana- 
tions of how this material was used in 
forming the interpretations. A verbatim 
account of the interview section is in- 


cluded. 
Case 4А: White Boy, Age 13 
The following case is an example of 
procedure in the blind analysis of a case 
in the Normal Children Group. It will be 
recalled that in this group no inquiry was 
administered. The material used for vali- 
dation has been more fully described 
above (see Chapter II). It is chiefly a sum- 
mary of independent analyses by other 
workers using several different instru- 
ments such as TAT, Rorschach, inter- 
view, self-rating questionnaires, etc. 
Below we present the analyses that were 
made, the rationale for the analyses, direct 
quotations (from the validating material) 


of such statements as seem to be pertinent 
and the scoring category into which the 
interpretation was placed. It is important 
to keep in mind the fact that no verbal 


associations were available in analyzing 
this record. 


SYMBOL I (Fig. 12). 


Analysis: 

Although without inquiry it is difficult to tell 
exactly what the effect connected with the draw- 
ing might be, it is assumed that the sun is 
probably a source of warmth and at the same 
time, a distant object so far as the subject is 
concerned. 

Two interpretations were formed here: 
© A. Mother fixated. 

B. But the mother scems remote to subject. 
Validation: 

А. Mother tie largely a fixation. (TAT) 

B.... Marked by not over-affectionate pro- 

tection. 

Early lack of contact with mother. (TAT) 

Uncertain of mother. (Rorschach) 
Scoring: 

Since the above quotations from the valida- 
tion material seem to be practically a paraphrase 
of what was stated in the blind analysis of the 
drawing material, both IA and IB were scored 


plus. 
SYMBOL 1I (Fig. 13). 
Analysis: 

It seems to be a safe guess that. the attitude 
toward а *Jollypop" suggested a positive ap- 
proach. The oral element seemed obvious, too. 
For these reasons the following analysis was 
made: 

A. Oral fixation. 

B. Positive attitude in individual heterosexual 

situation. 
Validation: 

A. Mother dependence and unfulfilled suc- 

corance needs. (Rorschach) 

B. Positive interest in heterosexual situation. 


(Rorschach) 
Sexual interests that are developing. (TAT) 
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E 


Fic. 12. Case A: Symbol I. 


Scoring: 

"Unfulfilled succorance" seems to imply early 
deprivation in the context of mother depen. 
dency. For this reason it was felt that the state- 
ment from the validation material supported the 
likelihood of an oral fixation in this subject, а1- 
though it hardly proved the matter. A “partial- 
plus" was the scoring used here. 


*Term used in clinical conference to describe 
desire for passive role in which needs are sup- 
plied by an outside source. 


The hypothesis that there was a positive atti- 
tude toward heterosexual situations seemed 
fully Supported by the validation material, and 
the scoring for this hypothesis was plus. 


SYMBOL III. (Fig. 14). 


Analysis: 


The fact that the Subject drew a good figure 
Out of the symbol without changing its essential 
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Fic. 13. Case A: Symbol II. 
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Fic. 14. Case A: Symbol III. 


form seemed to indicate an acceptance of his Validation: 
father's masculinity on a positive basis. The fact 
that a lightpost was chosen would seem to indi- 
cate that the boy tried to associate guidance 
with the father figure. The analysis was written В, 
up as follows: 

A. Tries to identify with father. 

B. Father is the source of standards. 


A. Father first person mentioned on list of 
"people I'd like to be like," 
Strives to identify with father. (TAT) 

‚ Father source of strict, uncompromising 
moral code to which subject shows resist- 


ance but probably will internalize. (Ror- 
schach) 


= 
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a meal pg it 


Fic. 15. Case A: Symbol IV. 


Father man of firm demands which subject 


tries to obey. (TAT) 


Scoring: 

It seems entirely justified to assume th 
validating statement completely corroborated the 
hypothesis. The scoring therefore was plus for 
both statements. 


at the 


SYMBOL IV (Fig. 15)- 
Analysis: 

It seems likely that the control of sexual 
anxiety is somehow connected with display of 
force. The following analysis was made. 


A. Sexual anxiety controlled through display 
of force. 
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Fic. 16. Case A: Symbol V. 


Validation: 

A. Sent anonymous notes to older woman 
demanding she meet him in shack. (Case 
history) Insisted upon identification that 
no harm was meant. (Case history) 

Scoring: 

Because the only evidence in the record is 
but remotely related to the interpretation, and 
it is conflictual, the scoring was “partial-plus- 
minus (P+).” 

SYMBOL V (FIG. 16). 
Analysis: 

It did not seem far-fetched to assume that a 
barbed wire fence is connected with environ- 
mental restriction. Analysis was written as fol- 
lows: 

A. Much anxiety concerning environmental 

restrictions, 
Validation: 

A. First and foremost, subject struggled with 
emotional problem of dependence and in- 
dependence. (Rorschach) 


Anxiety chiefly in terms of negativism, more 
the situation-reactive kind, rather than 
some deep-seated disturbance . . . is source 
of guilt over resistance to parental control. 
(TAT) 

Scoring: 

The quotations from the validation material 
seemed in this instance to justify a full plus 
scoring for the hypothetical analysis. 


SYMBOL VI (Fig. 17). 
Analysis: 


The only thing of which one can be sure, and 
then only relatively so, in this case, is that there 
is a transitory quality expressed in the drawing. 
For this reason, the following analysis was made: 

A. Finds only transitory contact and no deep 

relations possible with other boys. 
Validation: 
^. Negative feeling of group toward him . . - 
he makes aggressive attempts to join groups. 
(Sociometric analysis) 


| 
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Жал EE КЕ Я 


Fic. 17. Case A: Symbol VI. 
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R pac of mette] glosi | 


Fic. 18. Case A: Symbol VII. 


y 
NE 


Fic. 19. Case A: Symbol VIII. 


СА 
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Scoring: o 
"The evidence in this case seem to warrant а 
full plus scoring. 


SYMBOL VII (Fig. 18). 
Analysis: 

Without having more information from the 
Subject, one might connect glasses with a wish 
for "better vision," and thus with a desire for 
better understanding. (We often say: “I see" 
when we mean: "I understand") There is also 
a need for shielding or protection symbolized by 
glasses, and thus we hypothecate a withdrawal 
reaction too. The statements made in the analy- 
sis of this drawing were as follows: 

A. Striving for greater understanding in ori- 

entation to opposite sex. 

B. Also has defensive withdrawal reaction with 

girls. 
Validation: 

No statements were found wl 
subject's attitude toward women in general or, 
particularly, the female members of his peer 
group. 

Scoring: 
The scoring here was ^40: 


hich related to the 


SYMBOL VIII (Fig. 19). 
Analy 
Closing the figure in this symbol has scemed, 
in our experience, to indicate a regressive per- 
sonality trend. Also, though it is not clear in 
the reproduction, the subject wrote "care" at 
first, and then erased the "e." The slip made 
in the spelling of the word "car" seemed to 
justify hypothecating a dependent attitude on 
the part of the subject. The analysis was as 


follows: 
А. Regressive and dependent. 
Validation: 
A. Strong childlike emotional dependency. 
(Rorschach 
Scoring: 


n from the validation. material 


The quotatior » 
f the inter- 


scemed to furnish corroboration О 
pretation, The scoring was “plus.” 


SYMBOL IX (Fig. 20). 


Analysis: 
From this drawing, even though it was labeled 
ossible without 


as it was, no analysis seemed р 
further information, except as а complete guess. 
Therefore, no analysis was attempted. 


SYMBOL X (Fig. 21). 
Analysis: 

Without further information, it seemed that 
only a statement of the passivity of the subject's 
personality could be made. A relationship has 
been noted between the drawing of vessels of 
various sorts on this symbol, and a passive atti- 
tude. The following statement was made about 
this symbol: 

А. Passive personality. 

Validation: 

A. Aggressive, argumentative, although peers 
feels his aggressiveness is inconsistent. 
(Case history and sociometric material) 

B. Emotionally dependent, (Rorschach) 
Scoring: 

"There does not seem to be a statement directly 
bearing on the question of passivity, and so both 
of the above quoted statements must be accepted, 
even though they are contradictory. The scoring 
was "plus-minus (=).” 

SYMBOL XI (Fig. 
Analysis: 

The general crudeness of the drawing plus the 
resistive implications of "crossed legs" seem to 
warrant the following analysis: 

A. Feels incompetent in competitive hetero- 

sexual situations. 

B. Feigns indifference to such situations. 
Validation: 

No evidence to be found bearing on the sub- 
ject’s relationships in the competitive hetero- 
sexual situation. 

This case illustrates analysis by content alonc. 
The contrast with the following case makes it 
clear how useful the verbal association material 
is in gaining understanding of the subject’s draw- 


ing responses. 
Case B: White Female, Age 24 


This subject was a member of the 
Normal Adult Subgroup B (interview 
group). She had been married for five 
years. When she volunteered for this 
study, her companion stated to the 
writer, “You now have in your group 
that rare thing, a really model wife.” It 
might also be added that she was a per- 
son of very superior intelligence, a leader- 
type who had proved her ability in a 
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Me polk ا‎ 


Fic. 20. Case А: Symbol IX. 
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Fic. 21. Case А: Symbol X. 
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2. Case А: Symbol XI. 
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number of ways. This is especially im- 
portant because some of tlie questions 
asked might at times appear like leading 
questions, Knowing this personality is to 
realize that she was not suggestible, and 
possessed the facility to see quickly 
through and into a question asked. 

The subject was given the directions 
described in Chapter II. She finished the 
drawings unusually quickly, taking little 
more than a half an hour for the entire 
performance. Then she was given the in- 
quiry questions which she proceeded to 
fill out in the way in which they appear 
on the copies of the drawings. The fol- 
lowing pages describe the process of analy- 
sis in detail. 

Included in the interpretations that 
follow is a verbatim account of the inter- 
view with the subject. 

Following is the list of inquiry ques- 
tions; these may be used as a guide in 
Interpreting the written remarks of the 
Subject, which were recorded after she 
had completed each drawing. 

A. What is this a drawing of? 

. Was it relatively easy or hard to draw? 


B 
C. What mood does it represent? 
D. Describe in detail the personalities of any 
people or animals depicted: 
1. What are they doing? 
2. What are they thinking OT 
. What mood do you feel as you 
‚ Do you like it? 
G. What does the drawing remind you 


feeling? 
look at it? 


E, 
F. 
of? 


Interpretation is based upon the an- 
Swers to these questions. 


SYMBOL I. Semi-circle Pattern (Fig. 23)- 


Interpretation 1: 
The fact that all the answers to the inquiry 


Questions expressed positive affect was inter- 
attitude toward the 


Preted as meaning a positive 
mother. 


Q. “Are you particularly 


A perhaps closer to her than to 
** "Oh, yes!” 


fond of your mother, 
your father?" 


This emphatic reply seemed to be sufficient 
for validation purposes. 

Interpretation 2: 

Since the subject labelled the structure drawn 
from stimulus-pattern "A library," it was as- 
sumed that there might be an interest in intel- 
lectual pursuits which was stimulated by the 
mother. This likelihood, of course, depended on 
the accuracy of the first interpretation that the 
subject had an affectionate regard for her 
mother. 

Q. "Do you share interests with either parent 
particularly?" 

A. "Well. My father is as intellectual as my 
mother . . . he is a teacher and writer, but 
my own interests are definitely like my 
mother's, especially in politics. She always 
used to guide my reading." 

The fact that the subject took pains to estab- 
lish that she might have taken her intellectual 
cues from her father but did not is evident. It is 
interesting that she placed her answer in an 
intellectual context also, even though the ques 
цо т had expected to have to ask a second 
question in order to validate her interpretation. 
Interpretation 9: 

тһе fact that the subject reports her own 
mood as being “nostalgia” when looking at the 
drawing suggests the interpretation that she and 
her mother are not as close as they once were. 
Q. “Have you and your mother been as close 

lately as you once were?” 

A. “Oh, no. We still live next door to each 
other, but for the past year or so—since she 
has opened her shop—I hardly see her at all.” 

The subject's tone as she said the above was 
hat depressed, it seemed to the questioner. 
it was clear from her answer that 
d the interpretation. 


somew! 
In any case, 
the facts validate 
SYMBOL II. The Semi-circle and Single Line 
Pattern (Fig. 24). 


Interpretation M 
The inquiry indicates positive affect. It is also 


noteworthy, however, that the mood of the pic- 
ture is called "relaxation," and the subject says 
that "everything is taking it easy." The inter- 
pretation made was that the subject had a posi- 
tive but passive attitude toward intimate hetero- 
sexual relations. Since she is married, it was as- 
sumed that this attitude was reflected in her 


relations with her husband. | 
“How would you characterize your relations 


with your husband?" 
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. A public building. 

Easy. 

Exhilaration. 

No particular personality. 
. Nostalgia. 
aes. 

. A library. 


M шщ юз 


wr 


Fic. 23. Case B: Symbol I. 
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Fic. 24. Case B: Symbol П. 


ESI 


. Everything 
. Solitude. 


LIE 


A terrace. 


Easy. 
Relaxation. 


Very much. 


. Summertime. 


is taking it easy. 
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Еш. 25. Case B: Symbol III. 


sy. 
Gaiety. 
Restraint (personality of the flower). 


= ور م چ پو = 


. Amy Lowell's “Patterns.” 


A. “Oh, very pleasant.” 

At this point it seemed that a more direct 
form of question could not be avoided. 

О. "Your relations with your husband are satis- 
fying, but more characterized by a feeling of 
tenderness perhaps rather than 'purple pas- 
sion'?" 

"Yes. How did you get that?" 

The object of this research is to test whether 

this kind of material can be used in this 

way." 


< 


SYMBOL III. The Single Straight Line Pat- 
tern (Fig. 25). 


Interpretation. 1: 

From the fact that the subject expressed lik- 
ing on the first figure and negative affect on 
this one, it was deduced that the subject's rcla- 
tions were less adequate with her father than 
with her mother. 

Q. “Could you say which parent you prefer?” 

A. Easily. It's a funny thing, but you sce—my 
father was disappointed. He wanted a boy. 
I'm afraid that he never did come to feel 
very good about me. So I could hardly be as 
close to him as to Mom. Of course, things 
are pretty cheerful on the surface with my 
father, too.” 

This response seems to be acceptable as 4 
confirmation of the interpretation offered. 
Interpretation 2: ч 

The fact that the content was a flower in this 
drawing seemed to be significant. Whether or not 
one might give gender to a flower? it seemed 
safe to consider such a drawing on the “male” 
ness” stimulus as depicting a relatively “weak 
figure. 

Q. “Which parent would you say was the morc 
forceful in your family?" 
A. "My mother has always 

family." 

This answer seems to supply sufficient con- 
firmation for the interpretation. 

Interpretation 3: 

Since the “personality of the flower" was соп” 
sidered to be “restraint” (Question 4 оп the 
inquiry), it was supposed that the same charac: 
teristic would be true of the personality of the 
subject's father. 

Q. “Would you say that your father's attitude 
toward you was one of restraint?” 
A. “I should say so!" r 

It is obvious that more might have been said 
in this analysis, particularly with regard to the 
reason for the mood of gaiety and the poem 
mentioned in Question 7 in the inquiry. How 
ever, as has been explained above, no more 
than three hypotheses were made apropos of 
any one symbol. Possible connections with hus 
band were not discussed. 


dominated the 


SYMBOL IV. The Inverted "V" Pattern (FiS 
26). 
Interpretation 1: 


i CTS " H m 
Since all the associations on the inquiry see" 


2 ive 
In the vernacular of the street, a passiv 
male homosexual is referred to as a “pansy: 
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QS 


Fic, 26. Са 


se B: symbol IV. 


- Woman nursing а headache. 
. Hard. 

Torment. 

. She із wearied with nu 
` Anguish, 

No. 

* My mother-in-law. 


dges and occupies her time with trading neighborly pettinesses. 
i es 
rsing gru 8 


г‏ ج ېو ي 
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4 


Fic. 27. Case B: Symbol V. 


A tornado. 
. Easy. 
. Fear. 


PII 


- The man, like Per Hansen, is a Stocky, broad-shouldered man; the woman is a stocky broad—he is 


concerned about getting his family to safety; she is wondering which of the household goods she 


should trundle along. 
. Determination. 
Yes. 
. Kansas. 


л 


IO: 


to refer to the subject’s mother-in-law in a nega- 
tive manner, it was assumed that there must 
have been conflict which affected the subject's 
sexual adjustment. The reaction to the symbol 
for sexual anxiety is not always interpreted in 
reference to intimate heterosexual relations. In 
this case, this seems to be the most likely inter- 
pretation. 


& 


А. 


> О 


- "You seemed somewhat wistful when 79и 


agreed that ‘purple passion’ does not сһагас- 
terize your relationship with your husband. 
To what do you attribute its absence?” 

"I don't know . . . I guess the story-book 
phase of our honeymoon is over.” 


- "How has it changed?” 
+ "Well, sometimes . . . I don't know, I get the 


SYMBOL V. The Diffuse 
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SYMBO! 


. Two fencers. 
Easy. 
Romance. 
Stock characters out of 
Excitement. 
Yes. 
` The Beggars Opera. 


بج چو ب 


feeling that he isn't all mine. you'll probably 
eat this up, but sometimes I feel there’s too 
much mother-in-law. Now, don't get me 
wrong, I do like her. 1 mean she's all right— 
you know? It's just that she can't let him 
alone. Sometimes I think he' 
her than to me. I'm trying to change that, 
but he is having а hard time adjusting to 
being the loser. Oh, well, what else do you 
want to ask me?" 


s more tied to 


Mass Pattern (Fig. 27). 


Interpretation 1: 


The combination of the attitude of “deter- 


mination” 
the context of th 
to the man and to t 
was determined to seck relief from the anxiety 
created by marital conflict. 

Q. “Have you had serious concern for the future 


Fic. 28. Case B: Symbol VI. 


L No. 6. THE CROSSLINES PATTERN 


a Basil Rathbone movie. 


with the scene of strife suggested, in 
е reactions previously ascribed 
he woman, that the subject 


of your marriage, I mean, really, for a period 
of time?" 


A. "Yes, when my husband returned from serv- 


ice I was quite sure that I could not change 
the plans that I had made to suit some of 
his new ideas. I don't know how he put up 
with me then. I used to pout all day long. 
However, there isn't any doubt about our 
staying together now." 
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Interpretation 2: 

Since it was assumed that the people in the 
drawing were the subject's husband and herself, 
the interpretation was made that the characteris- 
tics mentioned in Question 4 of the inquiry 
applied to them. 

O. "How dependable would you say your hus- 
band is?" 


A. "He's dependable if he's nothing else." 

О. "How would you compare your own reac- 
tions under stress with his?" 

A. “Well, it’s funny; he can be awfully silly. But 


when things get really tough, he's the one 

who considers the important values." 

. "How do you mean?" 

. "Well, coming back to our marriage, he's the 
one who kept us together when superficial 
concerns almost made me run away. God! 
But I was miserable! 

Well, let's go on to the next question." 
Interpretation 3: 

Continuing the logic of Interpretation 2, it was 
assumed that the subject considered herself to 
be materialistic, wanton. ý 
Q. “Of course. But I'm afraid I must ask you an- 

other question on this topic. Which values 
do you feel you tend to emphasize in con- 
trast to your husband, and is this anything 
that bothers you?” 

A. “Well, he's not materialistic like I am. I 
need pretty things, for example, but he 
doesn’t understand it. It still worries me at 
times but what can I do about it? He's right, 
I suppose, but T'm the way I am." 

О. "Sometimes you fear you аге very materialis- 
tic and—um—somewhat wanton?” 

A. "Yes, I'm afraid so. There isn't much doubt 
about that." 

The last interpretation, although accepted by 
the subject, was not suggested to her in a way 
which gave her much freedom for discussion. It 
was considered to be only a "partial-plus" vali- 
dation. However, the importance of her feeling 
that she tended to be materialistic in an anxiety- 
provoking situation is clearly confirmed. 


> О 


SYMBOL VI. The Male Group Symbol (Fig. 28). 
Interpretation 1: 

It seems practically а paraphrase rather than 
an interpretation of the inquiry statements to 
say that the subject sees men in terms of 
romance and excitement, particularly in terms 
of conflict over her. 

О. "How do you feel about the idea of men 
fighting over you?" 


A. "That's semething any girl would enjoy, 
isn't it?" 
О. "Well, of course, now that you're marricd 
and ‘out of bounds,’ so to speak, there proba- 
bly isn’t much of a likelihood that men are 
actually fighting over you?" 
“Oh, I would hardly say that they fight over 
me. But I'm not exactly a wall-flower either. 
Like at parties, I know that men still try to 
outdo each other to get my attention. That's 
what makes parties fun, don't you think?" 
It may seem "natural," offhand, to feel as the 
subject does. However, many women do not fcel 
similarly and some суеп disapprove of such an 
attitude. Thus, this interpretation is adequately 
specific for a particular subject. 


SYMBOL VII. The Female Group Symbol (Fig. 

29). 

Interpretation 1: 

The positive affect expressed on the inquiry 
suggested that the subject took a generally posi- 
tive attitude toward other women. 

Q. "How do you get along with other women 
usually?" 

A. "Oh, I certainly enjoy working with the girls 
in the office and, of course, my best friends 
have always been girls. I know some women 
just don't like other women, but I can't see 
their point." 

Interpretation 2: 

The characterization in Question 4 of the 
inquiry suggested that the subject had a tend- 
ency to feel superior to most women, her positive 
attitude notwithstanding. 

О. “What do you think about women as a group» 
I mean in the sense of their being alert, and 
so forth?" 

A. "It isn't that men are really superior, bu! 
most women get pretty stodgy—in a pleasant 
Sort of way. But that's not because they 
couldn't be otherwise. I certainly wouldn't let 
myself go the way some woman can, and 
seem to enjoy it." 

Interpretation 3: 

The fact that the description in Question 4 
of the inquiry seems to set the subject apart 
from the rest of the female group lent itsel 
further to the interpretation that the subject had 
а resistance to pregnancy based on her unvill- 
ingness to identify with the female group. 3 
О. "How do you feel about having children? 
A. "Oh, Тт strictly a career-girl type, 1" 

afraid: No part of housewifery appeals to me: 

This ‘babies and bridge-game’ life most of 


SYMBOL ELABORATION TEST 45 


= Fic. 29. Case B: Symbol VII. 


1. Two pregnant women. 

2. Easy. 

3. Placidity. 

4. Naive rather conven 
figure, very happy to 

+ Happiness. 

Yes. 

- Helen Hokinson. 


astidious enough to keep their skirt hems even despite their 


tional women, Ё 
n to. 


do just as Jesus tells them 


won 
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Fic. зо. Case B: Symbol VIII. 


1. А bulls-eyc. 
2. Easy. 

3. Achievement. 
4. None. 

5. Triumph. 

6. Yes. 

7 


. A round peg in a round hole. 


my girl friends have compromised on is not 
for me. And the way some men like to tie 
their women down with babies! I'll have a 
baby when the man can do the suffering for 


me. 


SYMBOL VIII. The Regression Symbol (Fig. 30). 
Interpretation 1: 
The fact that return to the inner circle is 


characterized as being a source of triumph, and 


is associated with reaching a goal in the inquiry: 

seemed to indicate that the subject is very much 

security-conscious. 

О. "How important is security to you?" 

A. "As a matter of fact, I’ve turned down 2 
couple of agency offers quite recently, even 
though they would pay so much better, Þe- 
cause I prefer staying at S——-—where I know 


D c ————— ا ا‎ ee E E eS 
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Fic. 31. Case B: Symbol IX. 


„ Man walking, with a cane. 

. Hard. 

. Doggedness. 

. A man of taste, set in his ow 
. Stubbornness. 

. No. 

7. А map 


Фән =‏ ېو 


exactly what my future will be. I guess that 


answers your question." 

This is particularly interesting since it has 
been learned that the subject's husband is em- 
Ployed as an engincer in what seems to be à 
relatively substantial position. 


SYMBOL IX. Aggression Symbol (Fig. 37). 
Interpretation 1: 

The fact that the stimulus-pattern was well 
Structured was interpreted to mean that the 
Subject does not express hostility explosively. 
Q. "Are you considered to be a person VIELE 

much of a temper?" 


n ways, indifferent to criticism, loving cities. 


A. "No I don't think that there is anything to 
be gained from outbursts.” 

The problem of ego-involvement might seem 
worthy of consideration in evaluating the last 
response. However, people who are quick to 
anger are often as proud of their spontaneity as 
others are of self-control. 

Interpretation 2: 

The characterization of the figure stated in 
Question 4 of the inquiry was interpreted as 
meaning that, when this subject was confronted 
by pressure to conform, she reacted quietly but 
with confidence in pursuing her own pattern, 

Q. “How do you feel when situations seem to re- 


WOU о م‎ = 


JOHANNA KROUT 


Fic. 32. Case B: Symbol X. 


. Flies around a brandy glass. 


Easy. 


. Indifference. 


None. 


. Hopelessness. 


No. 


. Short-lived love affairs. 


SYMBOL ELABORATION TEST 49 


Fic. 33. Case B: Symbol XI. 


!- Canoe on a moonlit lake. 
2. Easy. 
3. Weirdness. redi: ering the magic usually captured on calendar pictures, 
4. He loves solitude, is not above rediscovering the тав y 
will come home hungry and go to bed. 
5. Pleasurc. 
6. Yes. 
7. Cool summer nights. 
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quire that you change yourself to suit the 
desires of others?" 

A. "You mean, was I one of the first to change 
to the 'new look'? Well, seriously, that was 
the thing that kept me from joining the WAC 
during the war. I am afraid I get pretty 
stubborn about liking things to be my own 
way. I don't care to tell others what to do, 
but one thing I just don't pay any attention 
to is their trying to dictate to me." 


SYMBOL X. The Self-Concept (Fig. 32). 
Interpretation 1: 

The conception of a passive container holding 
something desirable which others seek, like flics 
near brandy, suggests that the subject sees her- 
self as someone passive, and yet sought after. 
Passivity is further emphasized by the answer to 
Question 3 on the inquiry, "mood of indiffer- 
ence.” 

О. “Generally speaking, do you consider your- 
self to be a ‘go-getter’ sort of person or not?” 

А. "Uh uh! I'm a lazy-Susan. But I'm pretty 
lucky about things coming to mc. Maybe 
that’s why I don't really have to push my- 
self too hard." 

The last remark made further questioning 
unnecessary since it confirmed the entire inter- 
pretation, 


SYMBOL XI. The Competitive Heterosexual 
Symbol (Fig. 33). 
Interpretation 1: 

From the fact that “solitude” appears only in 
connection with this symbol and Symbol 2 (indi- 
vidual heterosexual situation), it seemed that 
loneliness, rather than seclusion, was the conno- 


tation. (The latter was the interpretation on the 

basis of the inquiry in Symbol 2 alone.) Because 

the subject contrasts the mood of solitude with 
the attitude expressed outside of the home in 

Question 4 of the inquiry, a lack of emotional 

rapport with her husband seemed likely. 

Q. "In intimate situations with your husband, 
have you sometimes felt as though he does 
not share your feelings?” 

A. "Well, I'm not sure that I understand you. 
But most of the time I feel left out of things, 
as far as he is concerned. But I think that is 
just because we are so different." 

Interpretation э: 

The substance of the answer to Question 4 on 
the inquiry seemed to mean that the subject had 
desired to experience extra-marital relations, but 
resisted; since the "magic" usually captured on 
calendar pictures scems to have a definite roman- 
tic and sexual connotation, and the subject states 
that "the appetite" is not satisfied. 

О. "Have you ever come close to finding out 
whether you could respond more completely 
to another man, I mean other than your 
husband?" 

A. "Well, I haven't been unfaithful, if that's 
what you mean—although that's no promise 
for the future! (Laughs) Seriously though, I 
did plan one time, but I didn't go through 
with it." 

The interpretation of this case illustrates the 
interpretation of the manner of drawing, the 
content, and the verbal associations of the sub- 
ject. The verbal associations, particularly, helped 
in the successful analysis of this case. There was 
one “P+.” rating. The rest of the interpretations 
were fully corroborated by the subject's remarks. 


CO а 


° CHAPTER IV 


RESULTS 


А. ANALYsIS OF TOTAL RESULTS 


lc results obtained in this study will 
Е be analyzed first in terms of the scor- 
ing categories. The object of this ap- 
proach will be to learn whether the stim- 
ulus-patterns gave rise to interpretations 
Which could be substantiated іп a signifi- 
cant percentage of the cases. The second 
approach will aim to discover whether the 
Stimulus-patterns were equally subject 
to validation. Attention will then be 
given to specific factors which may have 


a bearing on the results: the factors of 
sex, emotional stability, sequence of pres- 
entation, age, and cultural background. 

The data, in terms of the scoring cate- 
gories, are summarized in Table 3. АП of 
the one hundred sixty-nine subjects are 
included. No more than three interpre- 
tations were made on a given subject- 
response. The average number was a 
total of twelve for the set of eleven stimu- 
lus-patterns. The 2,265 interpretations 
of all the subjects’ responses served as 


TABLE 3 


INTERPRETATIONS FOR THE 


VARIOUS STIMULUS-PATTERNS IN TERMS OF 


THE RATING-CATEGORIES: ALL CASES 


N=169 e 
Interpretations — 2,265 
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the basis for the figures presented. 

'The percentage of interpretations for 
a given symbol which fell into each scor- 
ing category is shown. The first column 
contains the percentage of interpretative 
statements on which no evidence could 
be found in the validating material. 'The 
rest of the percentages were computed on 
the basis of the number of statements 
on which either supporting or negating 
evidence could be found. Thus it is pos- 
sible to see the extent of agreement be- 
tween the validating material and the 
interpretative statements for each sym- 
bol. From this table we can compare the 
relative frequency of the different scor- 
ing categories. 

It is apparent on inspection of Table 
3 that all the symbols offer confirmable 
interpretations in an overwhelming per- 
centage of the cases. 'The percentages in 
the "plus" column range from 85.4% to 
94.7%. The size of the percentages as- 
sumes further importance in the fact 
that the standard errors of those in the 
“plus” column are all less than one-thir- 
teenth of the proportion, and in most 
cases equal about one thirty-fifth of the 
proportion. This makes the probability 
of finding complete agreement for inter- 
pretations based upon the subjects' re- 
sponses very high. 

Considering the data in terms of nega- 
tive probability (see “minus” column), it 
appears from Table 3 that interpreta- 
tions based on the straight-line (Fig. 4) 
pattern were most often contradicted by 
the validating material Here the con- 
tradicted statements amount to 8.975; 
the partially contradicted statements 
amount to 1.4%; and those for which 
there was both supporting and contradic- 
tory evidence amount to 0.5%. If we mul- 
tiply the sigmas of each proportion by 
three and add these numbers to the 


о TABLE 3A 


INTERPRETATIVE STATEMENTS FOR EACH 
SYMBOL FALLING INTO THE VARIOUS 
SconiNG Groups: ALL CASES 


Scoring Categories Per cent of Cases 
Used for in Each Scoring 
Validation Category 
o o- 44 
50-100 
PM o- 53 
ate о- 9 
IS 
P+ о- 0.3 
_ о- 30.7 
P- o- I1 


original percentages, we have for each 
case the maximum percentage likely to 
occur in the negative results (counting 
the “plus-minus” group on a half basis). 
This computation would indicate а 
probable maximum of 18.98%. Thus the 
most extreme case of lack of support still 
leaves a high degree of confidence in the 
reliability and validity of the procedure 
involved in this test. 

The effect of computing percentages 
without counting the cases in which no 
evidence was available is of interest to 
us. A glance at Table g indicates that the 
highest figure in the "No Evidence" 
column was 20.8%. Since this means 
that, even here, judgments could be ren- 
dered in 145 cases, it is evident that an 
adequate sample was employed. 

The ranges in Table 3A initiated to 
show the spread of scoring categories for 
individual cases, invite caution. There 
was only one case in which 30.7% of the 
judgments were contradicted, the re 
maining statements having been wholly 
corroborated. It is noteworthy too that 
the lowest figure of plus-ratings in any 
one case was 50%. There was partial 
corroboration in 32% of the plus-rating 
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TABLE 4 


INTERPRETATIONS IN THE VARIOUS RariNG-CATEGORIES IN TERMS OF 
THE STIMULUS-PATTERNS: ALL CASES 


° 


Complete Partial Partial Сопйїсї- Total 
Symbol Corrobo- | Corrobo- Dunes Contra- i Total M 
ration ration CERTA diction Evidence | Number | Without 
Wer Тр" “pt "к" or 
ч 13 203 12 18 1 2 249 236 
Semi-circle 4.6 11.6 14.6 14.8 6.2 11.2 11.0 11.9 
o.8 3-9 3.2 б.о 7.0 
Semi-circle & No. 19 160 4 14 o 4 201 182 
single li o, 6.7 9.2 4.9 11.5 22.2 8.9 9.2 
ее sÈ 9:1 HE 219 E 
: No. as 182 8 19 3 1 238 213 
Straight line % 8.8 10.4 9.8 15.6 18.8 5:6 10.5 10.7 
SE 9.7 3.3 3-3 9.8 5-6 
No. 38 152 7 1 o 207 169 
Inverted "Ув" % 13.5 8.7 8.5 1-4 6.2 9.1 8.5 
ЅЕ 0.7 3.0 2.3 6.0 
" No. 22 152 3 4 1 x 183 161 
Diffuse mass o TS 8.7 3.7 3-3 6.2 5.6 8.1 Вт 
SE 1.0 843 1.5 6.0 5.6 
No. 4 146 9 7 4 2 202 168 
Crosslines [A slo 8.4 10.9 5-7 25.0 11.2 9.0 8.5 
SE o.8 3.5 2.1 11.0 o 
Counter-opposed| Хо" 38 131 5% 5 Я us P 
semi-circles sk iid Za i % 2.3 А 
—1* 
Semi-circle No. 19 144 4 12 1 4 184 165 
within larger % 6.7 8.2 4.9 9.8 6.2 22.2 8.1 .3 
semi-circle SE 0.3 2.2 2.7 6.0 9.7 
No. 23 153 10 Б І 2 197 174 
Jagged line [A 8.2 8.8 12.2 6.6 6.2 11.2 8.7 8.8 
SE 0.7 3-4 2.3 6.0 7 
No. 21 178 9 II 1 222 201 
Incomplete"U" | 92. < а 10. .o 6.2 11.2 0.8 10.1 
3 sk = H $:6 6.0 7.0 
р 0.7 3.3 2. B 
i No. " 10 12 » 3 o 199 169 
Lines and semi- | 0° | 1990 $t 12.2 9.8 18.8 8.8 8.5 
circles Zo. 
SE о.б 3-4 жеу. 9.8 
теш 282 1745 82 122 16 18 2265 1983 


judgments. The highest percentage of no 
evidence ratings occurred in the Indian 
group, and in one case it was as high as 
44%. The special conditions obtaining 
here have been described elsewhere. 
However, more than half the judgments 
could still be evaluated, even in the case 
of this individual. 

Table 4 deals with other problems 
Which the total data help us solve. Here 
the percentages have been computed so 
аз to compare each symbol with each of 
the others in terms of the contribution 1t 
makes to the total number of each cate- 
Bory. By comparing the percentage in 


each column with the percentages in the 
last (or total) column we can see the rela- 
tionship between this contribution and 


the percentage of interpretations indi- 


cated for each symbol. 

The most striking fact which this table 
offers is that all the symbols are equally 
valid. The percentages in the “plus-col- 
umn” are practically identical with those 
in the last. In other words, while the 
symbols vary slightly in the number of 
interpretations to which they readily 
lend themselves in an individual record, 
there is practically no difference between 
their likelihood to produce a given num- 
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ber of interpretations and the likelihood 
of these interpretations' being wholly 
substantiated. Variations exist in the 
other categories, but they do not affect 
this relationship. 

Some of these variations are interesting 
to examine. For example, the "female- 
ness" pattern (Fig. 1) was the easiest pat- 
tern to rclate to the validating material 
(i.e, had the lowest per cent in the “О” 
or "no evidence" column). This is proba- 
bly a corollary of the degree of attention 
given the "femaleness" pattern (Fig. 1) 
by most of the intensive studies of indi- 
vidual behavior. The percentage of “по 
evidence" ratings for the "maleness" pat- 
tern (Fig. 4), was practically double that 
for the “femaleness” pattern (Fig. 1). 

'The most difficult symbols to validate 
in our study were those purporting to 
deal with ‘‘maleness” (Fig. 4), "female- 
ness” (Fig. 1), competitive heterosexuality 
(Fig. 7), and sexual anxiety (Fig. 10). The 
highest figure in the "no evidence" col- 
umn on any symbol was 13.5%, and the 
lowest number of judgments on any 
symbol turned out to be 145. This is, of 
course, large enough to provide a relia- 
ble basis for conclusions. 

The reason that these patterns appear 
to refer to more obscure material than 
do the others could be found in the fact 
that with these patterns we enter the 
field of tabooed social values. The sym- 
bolism involved in the remaining pat- 
terns could hence be more easily vali- 
dated than could these four patterns. 
Nevertheless, it will be observed that sub- 
stantial confirmation of validity has been 
attained even for these patterns as the + 
and P+ figures in Table 5 clearly show. 
Incidentally, the remaining symbols con- 
tributed to the "no evidence" column in 
proportion to the number of judgments 
based upon them in the entire study. 


Consideration must be given to the 
“minus” column at this point. The only 
three symbols for which we find dispro- 
portionately higher percentages in this 
column than in the total column are the 
symbols for intimate heterosexuality 
(11.595),  "femaleness" (14.8%), and 
"maleness" (15.6%). In view of the fact 
that these three symbols have approxi- 
mately the same percentages in the “plus” 
column as in the total column, it would 
seem that the author might have felt 
encouraged to extend her conclusions be- 
yond the evidence available. The fact 
that these are respectively fourth, first, 
and second highest in number of inter- 
pretations derived from the completed 
patterns, however, tends to support the 
conclusion that, in spite of the figures in 
the "minus" column, these proved to be 
exceedingly productive and reliable pat 
terns. 

In considering the conflictual scoring 
of “plus-minus,” it is worth noting that 
the concentric circles pattern (Fig. 3) and 
the intimate heterosexual symbol (Fig. 6) 
comprise each 22% of the items thus 
categorized. The conflictual rating may 
have been due in these cases to confus- 
ing statements, leading to uncertain ы 
terpretations, by independent workers: 
or it may be that it is particularly easy n 
these two cases to emphasize an aspect 
of behavior brought out by the drawings 
and inquiry, without attributing suthi- 
cient importance to coexisting factors. 
Similar explanations are probably арр! 
priate with reference to the less markedly 
skewed figures for the aggression (Fig: A 
and the “maleness” (Fig. 4) symbols- T P3 
other figures in this colmn are not sign" 
ficant when compared to the base- po 
centages in the last column of Table 4: m 
most cases, there were no ratings in this 
category at all, showing that obtainable 


E 
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5 TABLE 5 


INTERPRETATIVE STATEMENTS ON EACH SYMBOL FALLING INTO THE VARIOUS 
SCORING GROUPS: TOTAL NUMBER, MALES 


No Complete dus Conflict- arnal Sus 980 
H orrobo- | Corrobo- ing ontra- otal umber 
Буш Data. Еу рее ration ration | Evidence diction | Number | Without 
"ү" SP" inc Mp 0 
OIE No. 12 8o 10 4 6 112 100 
Semi-circle % 10.0 80.0 10.0 4.0 6.0 
SE 4.0 3.0 2.0 2.4 
Semi-circle & No. 12 83 8 1 6 110 98 
Single li [^ 10.9 84.7 8.2 1.2 6.1 
Bene sÈ 3.0 2:8 io 214 
е No. 13 87 14 5 14 I 134 121 
Straight line % 9.7 71-9 11.6 4-1 11.6 8.0 
SE 4.0 о.о 1.8 9.0 0.9 
No. 20 7 xi 7 1 III 91 
Inverted "Vs" VA 18.0 79-1 12.1 7.7 fr 
SE a 3.0 2.8 1.0 
1 No. 26 8o 5 1 116 90 
Diffuse mass % 16.2 89.0 5.6 141 
SE 3.0 2.0 1.0 
No. 21 97 1 5 124 103 
Crosslines [A 16.9 94.2 o.9 4-9 
SE 2. 9.7 2.0 
É 6 74 I 4 109 93 
Counter-opposed | N° 3776 19:6 16:1 4-3 
semi-circles SE 4.0 3 Р 2.0 
Semi-circle No. 8 82 3 1 4 1 99 9t 
within larger [2 8.0 90.1 3-3 D. +3 BI 
semi-circle SE 3.0 1.7 ы 
“| 6 106 03 
No. 13 87 5 6. 
Jagged line [A 12.2 93.5 3:3 vr 
SE 2.4 1.7 
No. 18 106 a, E S 132 ay 
өй" у 8 ° of 
Incomplete "U & 13.5 9355 Yd 0.9 1.8 
5 3 124 107 
i 04 5 
Lines and semi- No. EU 87.8 4-7 4.6 2.8 
circles & š 3.0 2a 21 17 


evidence tended to be clear on the points 
raised in the experimental interpreta- 
tions, even though this evidence was 
sometimes not conclusive as in P+ and 
P— ratings. - | 

The data dealing with conflicting evi- 
dence and partial contradiction, involv- 
ing as they do together 34 of the 2,265 
interpretations, arc negligibly small and 
do not require detailed analysis. 


B. SEx FACTORS 


"Tables 5 and 6 summarize the results 
according to the percentage of interpre: 
tative statements for each symbol in each 
scoring group. Table 5 contains these re- 
sults for males only. Table 6 contains the 
results for females only. 


In Table 5, the only figure in the "plus" 
column that is below 79% is that for the 
“maleness” pattern (Fig. 4). Since this is 
one of the highest of the corresponding 
figures in Table 6, while that for the 
“femaleness” pattern (Fig. 1) is lower in 
the female group, it seems likely that the 
difficulty of separating the personality's 
identification with the parental role from 
his conception of that pattern causes 
some difficulty in analysis. The fact that 
many of these cases were analyzed with- 
out benefit of inquiry must be kept in 
mind, however, in interpreting this 

oint. In spite of this, the "maleness" 
pattern (Fig. 4) in the "plus" column of 
Table 5 is represented by 71.976 with a 
sigma of 2-496. In the “femaleness” pat- 
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tern (Fig. 1) row of Table 6, the datum 
in the "plus" column is 8195 with a 
sigma of +4%. These, then, are highly 
reliable results. In general, there seems 
to be greater variability in the correctness 
of interpretations in the male group than 
in the female group, where the percent- 
ages are neither as high nor as low as the 
extremes of the male group. Why this 
should be so is not clear. However, the 
effectiveness of the data for both sexes is 
so apparent that this slight variation 
should not matter. 

The fact that the percentages for the 
two sexes go down when relationships 
with the opposite sex as a group are con- 


sidered has-been discussed above. It is 
clear, in these tables, that evidence is less 
easy to find in these areas, although the 
relationships with those of the same sex- 
group are rather easily determined from 
biographical data. Thus, in both sexes, 
the percentage in the "plus" column go 
up for the same sex group, and those in 
the "no-evidence" and "P-plus" column 
rise when the opposite sex is considered 
as a group. Again, it is important to note 
that nowhere do the figures in the “plus” 
column become unsubstantial or insig- 
nificant. 

The only two relatively large figures 
in the "minus" column in both tables 


TABLE 6 


INTERPRETATIVE STATEMENTS ON EACH SYMBOL FALLING INTO THE VARIOUS 
SCORING GROUPS; TOTAL NUMBER, FEMALES 


Com- i t Р + Total 
Partial | Conflict- | Partial Partial Num- 
No plete [o H H Contra- Total ur 
= orrobo-| _ ing | Conflict- | 40 Contra- A gi 
Symbol | Data | Evidence Corrabo- | Cation” | evidence | jig, | AF | diction | сыт | without 
"чо" ^ 
* No. 3 81 5 14 103 100 
Semi-circle & 2.9 81.0 5.0 14.0 
Е 48 2.0 35 
т" No. 1 73 1 1 6 1 97 82 
Semi-circle & [A ex 89.0 1.0 1.0 7.3 1.0 
single line SE 3.5 1.0 1.0 9.0 1.0 
No. 12 85 2 1 5 2 107 95 
Straight line % 11.2 89.5 2.1 1.0 5.2 2.1 
SE 3.0 id 1.0 2.0 1.4 
No. 18 97 4 6 115 107 
Inverted Уз" % 14.4 90.6 3.7 5.6 
SE 2.6 157 2.0 
No. 9 7° 8 1 1 1 95 86 
Diffuse mass & 9-4 81.4 9-3 8.1 1.2 
4.1 3-4 3.0 1.0 
No. 22 61 6 1 1 1 92 70 
Crosslines ros 23.9 as 1 6.6 I.I т.т т.т 
+2 9-3 1.3 1.3 1.3 
No. 17 70 82 
Counter-opposed 7 5 т 6 99 
semi-circles a жее 854 3 H 1.1 6-9 
А fx 7 — 
Semi-circle No. 8 72 5 2 6 93 oe 
within larger % 8.6 84.7 5.8 2.3 7.0 
semi-circle SE 3.9 2.4 5.0 2.8 س‎ 
No. 12 62 8 2 4 88 8 
Jagged line & 13.3 81.6 10.2 2.5 Sex 
E 4-3 3-4 5.6 2.6 
— 
No. тз 14 7 s 88 
Incomplete "0" | % | 13-0 | вз 8.0 s Е: s 
3-7 2.8 8.4 1.0 
: р was 
А ; No. I6 64 5 74 
Lines and semi- $ 9o 
" % 17.7 86.5 6.5 
coles SE 4.2 218 ei 


j 
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е ТАВІЕ 7 


INTERPRETATIVE STATEMENTS FOR EACH SYMBOL FALLING INTO THE VARIOUS 
SconiNc Groups: TOTAL NUMBER OF ADULT PSYCHIATRIC CASES 


No Complete E 
Symbol Data Evidence Corroboration Gon rei 
ug" ep" 
T No 4 79 5 
Semi-circle % 45 94.0 б.о 
SE 3.6 2.7 
Т No. 9 58 5 
Semi-circle & single line % 12.5 92.0 7.0 
SE 3:4 3-4 
8 мо 6 at. 8 
traight line 0 7-1 9.0 1.0 
F SE 3.6 3.6 
No 6 64 4 
Inverted “Уз” %. 9.3 94.0 б.о 
SE 2.9 2.9 
No 8 55 7 
Diffuse mass & ast Е a ° 
No. 8c 67 3 
Crosslines & 10.2 95-0 4 3 
 — € 
No. 9 igh E 
Counter-opposed semi-circles & Ss cd 2.9 
Las А No. 4 so 5 
Semi-circle within larger semi- % 6.7 91.0 9.0 
circle $Е 3.8 3.8 
No. 7 Es A жЕ 
А 10.9 . к 
Jagged line & 2.5 | 2.5 
83 4 
е 3 
1 ' > 3.3 95.0 4.7 
ncomplete “U’ $Е 2.3 2.4 
бт 3 
то 
iif ye 13.5 95.0 4-7 
ines and semi-circles БЯ 2.8 2.8 
C. FAcroR or EMOTIONAL STABILITY 


Occur i š ith the symbol for š 
ur in connection with : 1 or Fig. Those who doubt the existence of gen- 


the par ex (Fi ө ^ 1 
4) ан of the same na it gives eralized (generic) symbols have occasion- 
. This is interesting ally pointed out that the pioneer work 


ў R : trend А 
Support to the previously noted tr iih symbols of this sort was dones 


о the discussion the ; por Nee researchers interested in pathology. For 
namely, that the confusion О med by this reason, ме include here Tables 7, 


Parental figure with the role assu Ay 

the individual who identifies with = 
Ж 

Parent tends to lower the accuracy o п 


terpretation. 


8, 9, and io. Table 7 refers to бо adult 
psychiatric cases, while Table 8 refers to 


po cases of apparently normal adults. 
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TABLE 8 


INTERPRETATIVE STATEMENTS FOR EACH SYMBOL FALLING INTO THE VARIOUS 
SCORING Groups: TOTAL NUMBER, NORMAL ADULTS 


e 


5% Compte Partial Goats: теш 
: orrobo- | Corrobo- ee a Total Number 
Symbol Data Бе ration ration diction | Number Without 
n PH” _ “Су” 
= No. 4 66 6 76 | 72 
Semi-circle % $ 92.o 8.3 
SE 3.0 3.0 | 
Simi 3 No. 6 44 1 sI | 45 
ТЕШ circle & single % 33:6] 97-2 3 5 
2. 25 
Мо. $ 48 2 2 5 2 
Straight line 7o. 8.8 92.3 3.8 3.8 Y a 
SE 3.4 2.4 2.4 
No 13 36 3 52 39 
Inverted “Vs” & 25.0 92.3 7.7 Ë 
° 4.2 4.2 
i No 9 43 I | 53 44 
Diffuse mass % 17.0 97-7 2.3 
SE 2.0 2.0 
— 
+ | 
я No. 8 44 52 44 
Crosslines KA 18.3 100.0 
Counter-opposed No. 7 rid 48 az 
semi-circles & 14.6 DUE 
Semi-circle within > 3 "US 56 53 
larger semi-circle [33 5-4 a 
n No. I 46 2 49 48 
Jagged line & 2.1 ق‎ 6 
2. 2, 
No. I SI 2 54 53 
Incomplete “U” £ 1.8 96.5 3:6 
2. 2. 
| ^ 
Lines and semi- | P 2 42 I 45 43 
circles E 24 97.5 2.5 
A 2.2 2.2 


The only difference between these two 
tables is that the "no evidence" column 
in the case of the normals is higher in 
some categories, and so are the percent- 
ages of “pluses.” 

Tables g and 10 summarize the same 
data pertaining to children. Table 9, re- 
ferring to children with behavior dis- 
orders, is not significantly different 


from Table 10 (normal children). The 
discrepancies here are easily explaine 
by the fact that normal group drawings 
were all analyzed without access to пе 
quiry. А 
In other words, the degree of relia 
bility was slightly higher in the normal 
group. This may be due to the fact that 
the attitudes of those in need of, or 
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TABLE 9 


INTERPRETATIVE STATEMENTS FOR EACH SYMBOL FALLING INTO THE VARIOUS 
SCORING GROUPS: TOTAL NuMBER, BEHAVIOR PROBLEM CHILDREN 


Partial 


Complete Conflict- 
`f z: - 
Si ol D E О Corrobo- Corrobo- ing Саа 
ae ata ашы ration ration Evidence un 
(0) DE “pn =" 
No. 23 š 1 
Semi-circle % 92.0 4.0 4.0 
SE 5.0 4.0 4.0 
Semi-ci tsi No. 3 Ir i 
line circle & single ГА 14.6 95.0 5.0 
SE 5.0 5.0 
No. 20 2 3 
Straight line % 80.0 8.0 12.0 
SE 4.5 0.5 $5 
No. I 22 2 
Inverted “Уз” % 4.0 о.о 9.0 
SE 6.0 6.0 
No. 3 20 I 
Diffuse mass % 13.0 95.0 is 
SE 5.0 5.9 
+: 
No. 4 21 
Crosslines %. 16.0 Tome 
SE 
Я 3 16 
Counter-opposed > 16.0 100.0 
semi-circles oh 
we D. . 1 21 
Semi-circle within > 5.0 100.0 
larger semi-circle SE 
j P й е 
agged line ° 9 £ 
Е SE 
18 T 
Né Р А 95.0 5.0 
Incomplete “U” £ He! 5.0 5.0 
І 
20 
Я М№о. З 95.0 5.0 
Lines and semi-circles KA 12. 5.0 $e 


undergoing, therapy are in a greater state 
of flux than are the attitudes of those 
who are relatively stable. The reason for 
the larger number of «О” scorings 1n the 
normal group is probably that half of the 
cases were from the interview group in 
which evidence was hardest t9 establish 
Оп some points. For example, sexual 


anxiety (Fig. 10) was sometimes 


to interpret in terms of concrete be- 
havior, and thus the necessary informa- 
tion (positive or negative) was compara- 
tively difficult to elicit. Substantially, 
these four tables do establish the fact that 
the symbols are about equally appropri- 
ate for the emotionally disturbed and the 
more adequately functioning members of 


a given population. 
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TABLE то E 


INTERPRETATIVE STATEMENTS FOR EACH SYMBOL FALLING INTO THE VARIOUS 
Ѕсовіхс GROUPS: TOTAL NUMBER, NORMAL CHILDREN 


Complete à Conflict- 
No Partial ; ¿et 
Symbol Data Edere Cormbo- —á Evidence Contindichom 
D nc 
ME No. 3 38 1 4 
Semi-circle 2o. 19.4 88.4 (13.9% P+) 2.3 9.3 (2.3% P=) 
E. SE 4.7 7.3 4.3 
кы No. 8 2 
Semi-circle & ¢ 9 3 a 
н ; % 19.4 85.4 (9.0% Р+) 8.8 `9 
single line $Е бо 9 es 3:9 
No. 8 | 36 4 
Straight line % 16.6 90.0 (15.0% P+) 10.0 (2.5% P—) 
SE 4.5 4.5 
Ко. 12 21 I 
Inverted “Уз” % 34.2 90.5 .| (18.1% P+) 4.5 
SE 1.0 1.0 
No. 2 28 I 2 5 
Diffuse mass 96 6.6 90.3 (10.0% P+) 3.2 6.5 (3.0% P—) 
SE | 5.2 0.3 1.0 
No. 9 20 I 3 
Crosslines % 27.3 83.3 (16.6% P+) 4.2 12.5 
SE S.S 4.0 6.7 
Counter-opposed Но. 15 38. (15.596 P+) m 
semi-circles Sh 35.5 2-0 5-5% д 
Semi-circle No. 5 29 2 5 
within larger % 12.6 80.6 | (10.0% P+) 5.5 13.9 (2.5% P—) 
semi-circle SE 6.5 | 255 $.2 a 
No. 3 27 I 3 
Jagged line %. 8.8 87.1 (22.0% P+) 3.2 9.7 
SE б.о 3.0 5.2 m 
No. 10 31 тї б E 
Incomplete “U” 26. 20.4 81.6 (15.0% P4-) 2.6 15. 
SE 6.2 Sun 
z 2.4 5-9 A 
; " No. 8 28 
L d semi- I 
сел a & 21.6 Pr (28.0% P+) 3.4 (3.096 P —) 
3-3 


D. Facror OF Асе 

Table 11 contains the results com- 
puted in percentage for the group of 46 
children, age range 6-16. Table 12 con- 
tains the results computed similarly for 
the adult group, age range 17-69. When 
these two tables are compared, the differ- 
ences appear very slight. It will be ob- 
served that percentages in the children's 


group have been influenced by the com- 
parative difficulty of obtaining judgment 
without direct inquiry on most cases- 
Table 13 summarizes the results for the 
18 children in the Indian groups. The 
size of the sigmas indicates a relative de- 
crease of validity. However, there is some 
interesting information to be gleaned 
from this table. In the first place, "® 
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TABLE Ir 


INTERPRETATIVE STATEMENTS FOR EACH SYMBOL FALLING INTO THE VARIOUS 
SCORING GROUPS: TOTAL NUMBER, CHILDREN 


o 


Nu Complete | Partial | Conflict- | Contra- Partial Total 
Symbol Data | Evidence | Corrobo- | Corrobo- ing dicti Contra- Total Number 
"О" ration |Evidence| MPP | diction | Number | Without 
pu" gen por nor 
is No. 3 55 7 H 4 1 71 68 
Semi-circle pA 4:2 89.1 10.3 1.4 5:9 1.4 
SE 4-8 3.6 1.4 2.9 1.4 
Semi-circle & No. п 43 з 3 5 63 52 
single line % 17.5 82.7 5-7 ET ET 
SE 2 3-2 3-2 3-2 
- No. 8 so 6 2 6 1 m 6s 
Straight line ГА 10.9 77.0 9.2 з 9.2 1.5 
SE 5.2 3.6 2.2 3-6 1.5 
No. 13 39 E 3 6 
Inverted "Уз" [4 22.0 84.7 8.7 6.5 29 st 
SE 5-3 4.2 3-7 
Y No. 5 45 3 т 2 1 57 2 
Diffuse mass % 8.8 79.0 5.3 1.7 КИ 1.7 5 
SE 5. Sat 2.0 2.6 2.0 
No. 13 27 1 3 48 35 
Crosslines ГА 22.0 77-1 11.4 2.9 8.6 
SE 7-1 5-3 3.0 4.6 
8 5 3 m 4 
Counter-opposed Lu dois stis 11.6 7-0 d 
semi-circles sk 5.9 4.7 3-8 
Semi-ci No. 6 47 4 ais 4 x 64 58 
wit ERR [4 9-4 81.0 6.9 3-4 6.9 1.7 
semi-circle SE 5.0 о.б o.6 o.6 0.5 
No. 5 до 7 1 3 56 51 
Jagged li [^ 8.9 78.5 13.7 1.9 5-0 
gged line & 5.6 4.7 o.6 Sk 
No. 14 43 6 1 7 71 57 
Inc өп" © М 75:5 10.6 1.7 12.3 
omplete "О & 10.7 d ER 594 3.3 
S 89 | 8 H I 110 99 
Lines and semi- No: ies 90:5 8.1 1.0 1.0 
circles SE 9.4 4.3 0.3 0.3 


children involved were five years younger 
at the time the validating material had 
been collected than they were at the time 
the test material Was administered. This 
represents a difference between the ages 
Of eight and thirteen. The greatest low- 
Cring of percentages in the "plus" col- 
umn occurred with reference to the 
following symbols: . 

(1) Individual heterosexual situa 

(2) Peer groups Р 

(3) Regression and mother ue 

(4) Self-concept 


tion 


` z itua- 

(5) Competitive heterosexual situ 
tions i. 

It seems evident that these are t 
expect 


areas in which one would logically 


much change over this period of five years. 

The reason for the relatively high pre- 
dictive value (85% "plus") of the sexual 
anxiety symbol may be the fact that the 
manner of handling, rather than a spe- 
cific source of anxiety, was emphasized 
in the analyses. This might not change 
very much with age. Of course, the other 
data do not shed light on this problem, 
since all that can be told from them is 
that the symbol is predictive at the vari- 


ous age levels also for members of our 


own cultural group. 
Although the sigmas are relatively 
large (due to the relatively small number 
of cases), in comparison with the data 
from the rest of this study (in which one 
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TABLE 12 


е 
INTERPRETATIVE STATEMENTS FOR EACH SYMBOL FALLING INTO THE VARIOUS 
SCORING Groups: ALL ADULTS 


| Partial ^ 
Complete | B Total 
- No _ | Evidence | Contra- ber 
Symbol Data | Evidence Con ono (Corrobo- diction nea PE 
O 
р" е? _ “o” 
No. 8 145 II 164 156 
Semi-circle % 12.8 93.0 7.0 
SE 2.0 2.0 
Lei No. 15 102 6 123 118 
Mae HUE A % 12.2 | 87.0 13.0 
g SE 3-0 
G == — 
No. II 118 2 10 141 130 
Straight line % 7.9 90.1 1.5 7.7 
SE 2.2 1.4 2.0 
No. .19 100 7 126 107 
Inverted “Vs” 96 15.0 93.3 6.7 
SE 2.0 = 
" No 17 98 8 123 106 
Diffuse mass 96 13.8 92.5 7.5 
ЅЕ ^ 2.1 
ч No. 16 тїї 3 130 114 
Crosslines NG 12.4 7-4 2.6 
SE t8 
M 8 
Counter-opposed op : v ra es PU e 
semi-circles SE ^ > 7 T 
108 
Semi-circle within yo gh p E 115 
larger semi-circle S 1:7 
. No. 8 104 | 4 116 108 
Jagged line %. 6.9 96.3 3.7 
SE 1.7 | 
No. 4 134 6 144 140 
Incomplete “U” % 2.8 96.7 Sed 
SE 1.6 
Lines and No. а 103 a 119 107 
semi-circles 0. Um 96.4 3.6 
SE r.6 | 
becomes accustomed to sigmas of one- they may be regarded as contributor? 
thirtieth of the proportion), those for the evidence, from the viewpoint of the ine 
stable areas are of good value in the In- as a whole. On the other hand, wher 
dian Group. They are about one-eighth the fin 


of the proportion, and thus indicate a 
better-than-chance distribution. 

Results on the twelve Indian children 
are probably inconclusive. On the other 
hand, since they indicate high reliability, 


these results are coupled with 
ings of anthropological studies such as 
Roheim's and Hallowell's, they may not 
necessarily prove the universality of y 
symbols selected, but the likelihood © 
their doing so is greatly enhanced. 


| 


P 


SYMBOL ELABORATION TEST 63 


E. STIMULUS SEQUENCE AS A FACTOR 


Approximately one-third of the sub- 
jects were given the symbols in random 
order. These results are summarized in 
Table 14. Comparing them with the 
results in Table 3, the “plus” percentages 
seem to be slightly higher. The explana- 
tion for this might easily lie in the elim- 
ination of thirty cases in which no in- 


quiry had been conducted. The order 
was the same for everyone in this par- 
ticular group (normal children). In any 
case, the results are not impressively 
higher in Table 11. The conclusion one 
can draw is that, even if the only reason 
for less than 100% plus-ratings for all 
symbols were a question of contamina- 
tion of attitudes, the 15% or less devia- 


TABLE 13 


INTERPRETATIVE STATEMENTS FOR Eac SYMBOL FALLING INTO THE VARIOUS 
SCORING GROUPS: INDIAN GROUP 


7 i I Омо Д Complete Conflicting m 
Symbol Data | Evidence Corroboration Evidence Contradiction 
“O” un ur 
No. 2 9 2 
Semi-circle % 15.4 81.8 (54.5% P+) 18.1 
SE 11.3 11.3 
- No. o 8 4 
Semi-circle & DA © 66.6 (8.3% Pr) 33.3 
single line SE 13.5 13.5 
No. 4 12 3 
Straight line š 8o. 20.0 (13.3% P—) 
Straight line & 21.1 0 ze 37% 
No. 6 її 2 
Inverted “Vs” % 30.0 85.7 (21-4% P+) 14.2 ( 7.1% P—) 
SE 10.0 10.0 
No. 3 x T 
i Р © 
Diffuse mass & 21.4 29 (45.5% P+) 8.4 
3 
No. 7 7 = 
. s% P 30.0 (27.3% P—) 
Crosslines & 48.9 е (45.5% P+) Ө 
No. 5 5 i % P. 7 
Counter-opposed % 37.5 62.5 (40-0% P+) | 12.5 (20.0% P) | 25.0 
semi-circles S 14.8 10.0 
3 
Semi-circ No. 3 7 А ° 
within lager DA arg | 63-6 (27.376 P+) 95 “Ж 
semi-circle SE ХБ 
Ж : š 
No. 2 8 (37.s% P+) 11.1 (12.5% P—) 
‚ Jagged line % 20.0 € (37-576 | 1o: 
PEN A Я 
No. 5 9 (23.105 P+) | 21-4 (15.5% P) 14.2 ( 7.7% P—) 
Incomplete “uy” % 27.8 tae 3 aa 9.0 
| | 6 
= 1I ur 
Lines and No. РЕГ 64.7 (11.8% Р+) 35-3 (11.8% P ) 
Semi-circles & > 11.6 | 
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TABLE 14 6 


INTERPRETATIVE STATEMENTS FoR EACH SYMBOL FALLING INTO THE VARIOUS 
SCORING GROUPS: RANDOM ORDER Group 


No Complete) dua Conflict- Саша. sil 
zv: orrobo- orrobo- ing а ا‎ Total Number 
Буы Date | Evidence: | Sagon ration | Evidence | diction | Number | Without 
RUE p" gee = tor 

No. 3 62 I 70 
Semi-circle % TUS 88.6 I.4 xis "s 

SE 3.8 1:2 3.6 
Semi-circle & No. 5 49 3 57 52 

5 H % 8.9 94.3 т 

single line sk qc $ " 

No. 5 52 
Straight line KA 8.9 91.2 1.8 Ta 8 d 

SE 3.8 3.4 3.4 

No. 8 50 1 62 4 
Inverted “Уз” % 12.9 92.5 1.9 зв Š 

SE 3.6 1.5 38 

No. 4 48 1 53 
Diffuse Mass % 7.7 90.8 1.5 vs M 

SE 3.9 1.6 34 

А Мо. 8 51 1 бо 5а 

Crosslines VA tz 98.1 1.9 

SE 1.9 1.9 n 
Counter-opposed No. 49 Ы £ 59 5? 
semi-circles" di 22.8 94:2 1:9 3:9 

SE 3.2 1.9 0.7 

7 

Semi-circle within | Мо. 4 46, 3 53 49 
larger semi-circle sk 7.5 93. б.2 
. р 3.3 3.3 

No. 2 4 2 so 
Jagged line % 3.8 9416 бе Ў 

SE 3.3 3:3 

No. 1 63 64 63 
Incomplete "U" & 1.5 тоо 
Lines and semi- No. 3 НЕТ 4 50 56 
circles % 550 92. $42 

SE 3-4 3-4 


tion from 100% may be attributed to the 
significance of the order in which the 
stimulus-patterns were presented. How- 
ever, there is good reason to believe, as 
has already been demonstrated, that such 
factors as lack of sufficient inquiry and 
lack of inclusiveness of validating data 
affecting Table 3, all have a negative ef- 
fect on the "plus" percentages. Thus one 
is left with the impression that the sym- 
bols included in this investigation are 


sufficiently unique in individual meaning: 
and effective as individual stimuli, to re- 
duce the sequence of their presentation 
to a position of relative unimportance- 
We can thus conclude that the order of 
presentation of stimuli—no less than the 
factors of sex, emotional stability, 48€ 
and cultural background—has no apprec* 
able effect on the results obtained by our 
technique. 


CHAPTER V 


CONCLUSIONS 


A. GENERAL IMPLICATIONS 

or Tuis STUDY 
T hypothesis tested in this study was 
that certain abstract visual forms 
may be regarded as universal symbols. 
Eleven forms were selected for use as 
stimulus-patterns. The reactions of the 
subjects to these patterns, expressed on 
both a non-verbal and а verbal level, 
were analyzed in terms of the hypotheti- 
cal meanings of the stimulus-patterns. 
Validation of the meanings of the stimu- 
lus-patterns depended upon whether 
these interpretations about the subjects 

could be established as true. 

The results described in detail in the 
preceding chapter show that the eleven 
Stimulus-patterns all disclosed high pre- 
dictability as cues to emotional dynamics. 
We may thus conclude that these stimu- 
lus-patterns could be reliably interpreted 
on the assumption that they contain in- 
herently those subtle symbolic meanings 
which we attributed to them im our 
basic, pre-experimental rationale. 

The results are similar for both emo- 
tionally stable and emotionally unstable 
members of our culture, regardless of sex 
or age. They are less conclusive, because 
Of the small sample used, for members of 
Indian cultures. In the Indian groups, 
too, however, the evidence supports the 
hypothesis under question. Further work 
With members of other cultures 1$ there- 


fore warranted. 


B. Use or THE SYMBOLS 

AS Basis FoR PROJECTIVE TECHNIQUE 
shi he 

In discussing the use to which th 


š i ive test, 
Symbols may be put in а projective š 
it is necessary first to consider the mea 
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ing of projection in this sense. As under- 
stood by this writer, projection refers to 
the qualities which a situation assumes 
as a result of the dynamics which the sub- 
ject brings to bear on it. The more 
plastic a situation is, the more unstruc- 
tured it is, the more of his own qualities 
may we expect a subject to reveal in cop- 
ing with the situation. However, a sub- 
ject’s "projections" may be observed in 
all life situations. In using a projective 
technique, we standardize a set of condi- 
tions which can be presented to each new 
subject, so that his reactions in this lab- 
oratory situation may be compared to the 
reaccions of others. Even in so thoroughly 
structured a situation as one in which a 
soldier is required to give a salute, we may 
see this. Some men respond snappily and 
with pride in the act. Others respond so 
quickly that the salute is barely meaning- 
ful. Still others: are slow and deliberate. 
In each case the individual shows his own 
style of reaction and, thus, certain aspects 
of his personality. 

The Rorschach test is possibly the least 
structured and the most plastic of the 
various projective techniques in common 
use. However, the manner of the subject's 
response has been more thoroughly stand- 
ardized than the meanings of the situa- 
tions to which he responds. Other tests, 
such as the T AT, present somewhat more 
structured situations to the subject. How- 
ever, the clearer definition of the stimu- 
lus-material tends to limit the extent of 
individual projection. The Symbol Elab- 
oration Test, as we might call the appli- 
cation of the stimulus-patterns described 
herein, offers an advantage in that the 
situation presented to the subject is dis- 


66 


tinctly structuralized for the informed ob- 
server (clinical psychologist) while, for 
the subject, there is a minimum of con- 
ventional meaning to guide his responses. 
A further point in its favor is the fact 
that this test does not depend upon the 
subject's selection of certain elements of 
the field for his interpretation. In most 
techniques, it is sometimes a question of 
whether or not avoidance of a part of the 
stimulus is responsible for emphasis upon 
another part. Each of the symbols here 
presents a stimulus situation on an all-or- 
none basis. Thus, it is always clear to 
what situation specifically the subject re- 
acts. Finally, the non-verbal character of 
the completion required in this test, pre- 
vents blockage in many subjects who 
might be inhibited on a verbal test hav- 
ing similar objectives, but who give’ no 
evidence of inhibition here. 

Also important is the relative ease of 
interpretation made possible by the test- 
er's previous knowledge of the implica- 
tions of the symbols in this test. It is true 
that experience in free-drawing analysis 
is very helpful in interpreting the sym- 
bol-completions offered by the subject. 
However, even experience in handling 
symbolic material is not essential to the 
profitable use of the test proposed, so 
long as the symbols are accepted as carry- 
ing certain non-conscious implications. 
The inquiry-questions provide guidance 
in the interpretation of the drawings, 
such that the affectual tone of the subject 
toward a given symbol can be revealed. 
In every case there is some clue as to the 
subject’s feeling toward the area of ex- 
perience symbolized. Almost never does 
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a subject totally refuse to draw upon a 
sheet; and, when he does, it may be dis- 
covered, or even assumed, that his con- 
flict is with regard to the particular area 
symbolized on the sheet. The influence 
of the order of presentation of the sym- 


bols on the reactions of subjects has been. 


proved elsewhere to be negligible. 

The foregoing is not meant to imply 
that this test is expected to supplant the 
projective techniques already їп use. 
Rather, it is felt that to propose the em- 
ployment of a new test is to accept re- 
sponsibility for justifying its addition to 
a standard battery. Anyone familiar with 
the various projective techniques now 
utilized in studying personality knows 
that each of them has a value of its own. 

One important limitation to the use 
of this new test lies in the fact that only 
one main response can be given to each 
symbol in a single administration. The 
Rorschach test, the TAT, and others pro- 
vide an opportunity for as many re- 
sponses to a single stimulus as the sub- 
ject wishes to make. Also, the fact that 
the symbols are centered on the page 
limits to an extent the applicability of 
the free-drawing type of analysis to this 
material. Another limitation lies in the 
effect of a strong dislike for drawing on 
the nature of the task. In such cases, the 
analysis depends solely on answers to the 
inquiry questions expressing the subjects’ 
general attitude toward the stimulus ma- 
terial. Such an analysis is found to be 
schematic. Nonetheless, it is hoped that 
there will be further use for the symbols 
in clinical as well as research work. 
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CHANGES IN RESPONSES TO THE MINNESOTA MULTIPHASIC 
INVENTORY FOLLOWING CERTAIN THERAPIES! 


CHAPTER I 


INTRODUCTION 


HREE INFLUENCES stimulated the 
— study: (a) the intensive de- 
mand for тоге and better psychiatric 
services, (b) the attempts to formulate 
new systems of psychotherapy, and (c) 
the very recent but increasing efforts to 
establish objective criteria for the evalua- 
tion of therapeutic endeavors. Specifi- 
cally, this study is concerned with an ex- 
ploration of certain of the theoretical 


and practical problems involved in the 
measurement by means of a structured 
Personality inventory of therapeutically 
induced changes in personality. 
Psychiatry is currently experiencing a period 
of intense activity which is in direct response to 
the demands for therapy of the numberless 
Personality disorders of all degrees engendered 
in civilian and veteran populations alike by the 
emotional stresses of World War II (1). With the 
addition of the personal maladjustments Orig- 
inating under the pressure of the merna 
years to the pre-war case load, it is natural i 
the interest and contributions of the و‎ 
and the public should be in the direction o 
increasing the number and quality of centen 
for the training of more clinicians. The approxi- 
atrists and 800 clinical psychol- 
he rolls of their respective 
Organizations cannot, in point of numbers, em 
Че in anything approaching adequate fas es 
the prophylactic and therapeutic needs of е 
Population, Estimates have placed the number 
Re Psychiatrists required jn certain areas anon 
OUr to seven times the number now available 


(5). It was primarily to remedy the posed 
adequately trained personnel that the ome 
Tental Health Act was passed in 1949- ка 
„€ provisions of this act, the United SS TH 
lic Health service is directing and subsidizing 

i mitted to 
Е thesis su iversity 
nt of the re- 
tor of Philos- 


mately 4000 psychi 
Ogists now listed on t 
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an extensive program to improve training facili- 
ties in psychiatry, clinical psychology, and psy- 
chiatric social work and to increase the number 
of trainees in these fields. 

It is necessary, though difficult, in the midst of 
practical considerations and the exigencies of 
daily service to attempt a long view. Some work- 
ers already have expressed doubt as to the ulti- 
mate validity of the present emphasis on train- 
ing. Kubie suggests that there is a fallacy in the 
theory that the training of more workers will 
suffice to improve the psychiatric health of the 
community or nation, at least as measured in 
terms of the number of patients (12). He points 
out shat increased availability of services invar- 
iably leads to increased demand, so that the 
"catching up" point is never reached. An ob- 
viously crucial factor in determining the pa- 
tients-per-therapist ratio is the effectiveness of 
the remedies available to the clinician, Not only 
must more workers be trained, but their training 
and therapeutic armamentarium must be en- 
hanced by new techniques of proven efficacy. 
The development and testing of new techniques 
of therapy demand a research orientation, and 
research programs must not be grossly subor- 
dinated to the construction of curricula if psy- 
chiatry is to derive maximum benefit from the 
advantage it now enjoys under the triple in- 
centives of social need, favorable public opinion, 
and government funds. ^ 

It is fortunate that the effective relationships 
which were established between psychiatry and 
clinical psychology in the armed services show 

romise of being maintained. Clinical psychol- 
ogy also is benefiting from government subsidi- 
zation and while this support is, again, primarily 
to increase the number of trainees in this field, 
it may be hoped that a direct impetus to research 


will result. 3 
It is only through the medium of research, 


supported and conducted by all professional 
workers concerned with the treatment and pre- 
vention of psychiatric disorders, that we may 
hope to achieve the construction and sharpening 
of those diagnostic, prognostic, and evaluative 
instruments which are necessary to assure the 
most efficient application of such facilities as 
are available for the care of personality dis- 
turbances at all levels of their manifestations. 
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What are the most pressing research 
needs in the field of mental health from 
a long-term view? First of all, there is a 
need for research which will establish in 
a definitive manner the level of effective- 
ness of existent therapies with respect to 
specifically diagnosed disorders. Sec- 
ondly, there is a need for research which 
will establish the validity of newly pro- 
posed therapies. Inherent in researches 
of this type is the specific problem of 
determining which patients improve, 
and how much they improve, and which 
patients do not improve under a given 
regimen. The solution of this problem is 
contingent upon the attainment of a 
standard, objective, representative evalu- 
ation of the status of an individual's per- 
sonality. 

‘This paper presents an investigaticn of 
some of the problems arising in the 
application and interpretation of such 
evaluations, particularly as they furnish 
data on the effectiveness of given thera- 
pies. 

'The histories of the major psychiatric 
therapies have followed a certain pat- 
tern. In brief, there has been an initial 
enthusiastic announcement of the new 
technique accompanied with or followed 
shortly by figures on recovery which have 
represented not merely a sizeable im- 
provement over those for earlier thera- 
pies but which have approached the 100 
per cent point so closely as to suggest a 
specific cure. Later reports on the tech- 
nique from other clinics and sometimes 
from the clinic of origin have shown con- 
siderably lower frequencies of recovery. 
Thus, Sakel in first publishing on insulin 
shock therapy of schizophrenia claimed 
87.9 per cent remissions. This figure has 
not been duplicated by any other work- 
ers. 

There are undoubtedly many factors 
involved in the explanation of diver- 


gences between comparable figures for 
recovery ratés in two clinics as well as for 
the failure of originally very high rates 
of remission in a given clinic to be main- 
tained in further studies. A considerable 
part of divergence in interclinic statistics 
is probably due to differences in the sys- 
tems (or lack of system) by which the 
status of patients is evaluated both be- 
fore and after a course of therapy. Most 
commonly such evaluations are char- 
acterized by subjectivity, with little con- 
cern that the evaluation of the patient 
before and after therapy be on the same 
schedule of variables (3). In extreme 
form, this type of evaluation of a pa 
tient’s progress is expressed in terms of 
the facts that his diagnosis at time of ad- 
mission to the hospital was "'schizo- 
phrenia—paranoid type" and that he was 
noted at time of discharge to be frec of 
hallucinations. There would be many 
observations and facts concerning the 
patient which went "into" the diagnosis 
and yet which, in the form of a diagnosis, 
would not be available as a basis for 
evaluating his status when discharged. 
Likewise, there could be many observa 
tions concerning the patient at the time 
of discharge which would be relevant to 
predictions of his adjustment at home 
and at work and yet which would not be 
properly weighed and communicated by 
recording only that his hallucinatory be 
havior had ceased. Evaluations of this 
gross type permit recognition of the most 
obvious degrees of improvement, ie 
abatement of severe symptomatology; but 
they are not sufficient for detection p 
improvement in the patient's personality 
with respect to those inadequacies in 50° 
cial adjustments and diurnal habits 
which served as the first clues to his ill- 
ness. From this point of view, the max" 
mal efficacy of a given therapy for 2 
given patient could not be ascertained at 
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the time of his discharge from the hos- 
pital but only after he had Teturned to a 
normal, non-institutional environment 
and only in terms of the quality of his 
adjustments in that environment. Fur- 
thermore, the observations and evalua- 
tions of his post-therapy behavior would 
have to be ordered to the same method 
of recording as was used to depict his 
Status prior to therapy. Limitations of 
time and money have generally pro- 
hibited the attainment of this ideal para- 
digm for the validation of a therapeutic 
procedure. The next best approach de- 
mands that the status of the individual 
be thoroughly measured in terms of such 
observations as are possible immediately 
prior to the start of a course of therapy 
and immediately upon the completion of 
that course. It is equally important in 
this more circumscribed evaluation of 
the patient's progress that his status be- 
fore and after therapy he measured with 
respect to the same variables. 

It may be argued that no matter how 
rigorous the procedure of investigation, 
the “pure” effects of shock therapy, for 
example, may not be determined since 
such therapy is always imbedded in a 
certain setting and adminstered by per- 
sonnel whose effects on the patients can- 
not be held constant. It may further be 
argued that the effects assignable to 
shock are inextricably mixed with the 
effects of the less dramatic psychothera- 
peutic efforts of the physician. With re- 
gard to this last point, it must be recog- 
nized that the ancillary therapeutic pro- 
grams developed for the patient receiv- 
ing convulsive therapy are probably as 
diverse as the theories proposed to ex- 
plain the mechanics (or dynamics) of 
shock as therapy. This source of dis- 
crepancy should be remembered when 
it is necessary to explain different meas- 


ured effects in different clinics when the 


same (?) therapy has been applied. 

With regard to the complicating effects 
of other psychotherapeutic efforts (e.g., 
interviews) accompanying a major medi- 
cal therapy, it should be recognized that 
the clinic in which the decision to apply 
a convulsive therapy is not accompanied 
by a reduction (either planned or un- 
planned) in the number of hours spent 
by the physician in personal contacts 
with the patient is probably rare. When 
a gross procedure of the nature of electro- 
shock seems a promising approach with 
a given patient, the therapist probably 
feels some justification in reducing his 
contacts with that patient so as to af- 
ford more hours to those patients for 
whom no "specific" therapy is available. 
In short, the effects of a therapy such as 
eleatroshock are probably open to evalu- 
ation in a more pure form than would at 
first appear to be the case. 

The design and carrying through of 
investigations of the above nature must 
necessarily await the development of ade- 
quate techniques for the thorough de- 
scriptive assay of the individual person- 
ality. It has been only within the last 
decade that clinical psychology has of- 
fered an instrument having the essential 
virtues of psychiatrically oriented vari- 
ables empirically validated and quantita- 
tively expressed, This instrument is the 
Minnesota Multiphasic Personality In- 
ventory (8, 9). This psychiatric inventory 
was designed primarily to furnish an ex- 
ternal, objective source of information 
bearing on the detection and diagnosis 
of personality aberrations. In addition to 
four scales which indicate the validity of 
the subject’s record, the MMPI measures 
personality deviations on nine clinical 
scales: Hs—hypochondriasis, D—depres- 


2 The Minnesota Multiphasic Personality In- 
ventory will be referred to hereafter as the 


MMPI. 
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sion, Hy—hysteria, Pd—psychopathic de- 
viate, Mf—masculinity-femininity, Pa— 
paranoia, Pt—psychasthenia, Sc—schizo- 
phrenia, and Ma—hypomania. 

The validity measures are the “Cannot 
Say,” L, F, and К scales. The first of these 
is simply the number of items placed in 
the “Cannot Say” category. Since the 
“Cannot Say” responses are not scored 
on any of the other scales, a large num- 
ber of such responses has a generally sup- 
pressing effect on the clinical profile. 
The L scale is composed of fifteen items 
designed to measure the strength of the 
subject's tendency to "fake good" by 
selecting responses which place him in a 
socially acceptable light. The F scale is 
composed of items the scored responses 
to which are extremely infrequent; cle- 
vated F-scores may occur as a resu't of 
errors in scoring, carelessness or lack of 
comprehension by the subject, or a “fake 
bad" tendency in the subject. The K 
scale is the most recent of the validity 
scales and provides a measure of the de- 
gree to which "defensiveness" has char- 


acterized the subject's test-taking atti- 
tude. 

The wide use of the MMPI by psychia- 
trists, clinics, and general physicians at- 
tests to its success in a diagnostic role (2; 
15). It is interesting to note that only 
recently has there been a sign of appre- 
ciation of its potential value in measur- 
ing the effects of a given course of ther- 
apy (7, 15, 16). That there should be such 
a lag in the use of a basically diagnostic 
instrument for the evaluation of change 
reflects a gencral failure on the part of 
psychiatrists and psychologists to recog 
nize that the determination of the pa 
tient’s condition at any time after hos- 
pitalization or the onset of therapeutic 
interviews cannot be usefully made €x- 
cept on the very same variables the assess 
ment of which served as the basis for 
hospitalization or the onset of therapy: 
The clinician should be no less willing to 
accept supportive information for his 
post-therapy evaluation than he was to 
seek assistance in the original problem of 
diagnosis. 


CHAPTER П | 


PURPOSE, SUBJECTS AND PROCEDURES 


T1 MAJOR ASSUMPTION underlying 
the investigation reported here is 
that an instrument of demonstrated 
validity in the measurement of personal- 
ity deviations of psychiatric import is of 
equa] validity for the evaluation of per- 
sonality changes and of the therapeutic 
operations deemed responsible for those 
changes. It is not assumed, however, that 
a direct evaluation of change in per- 
sonality, or of the efficacy of a therapy, 
is possible through the single study of 
score changes. Rather, it is believed nec- 
essary to determine whether there are 
patterns of change in scores on a multi- 
variable instrument which are character- 
istic for given patients under given 
regimens and to discriminate, on as 
objective a basis as possible, between 
those changes in the personality assay 
which are therapy-related and those 
which are merely chance. Further, it is 
believed necessary to evaluate the extent 
of the relationship between changes in 


item and associated score 


changes. 
In the event that an individual com- 


pletes a personality questionnaire such 
as the MMPI on two different occasions, 
certain analyses may be made of any dif- 
ferences which are observed in the result- 
ing profiles, It may be asked if, the two 
profiles differ in a degree which would 
appear to approach significance. More 
Specifically, this question relates to 
Whether the two scores obtained on each 
single scale of the profiles differ by such 
ап amount as exceeds that for which the 
hypothesis of no true difference (be- 
tween the means of hypothetical popula- 


tions of first-profile and second-profile 


response 


ex 


scores) is tenable. For simplicity in refer- 
ence, this is the problem of status-score 
change. The observed differences be- 
tween the various pairs of status-scores 
constitute change-scores. 

There is still another kind of change 
not yet extensively considered, perhaps 
because attention to status-score changes 
has tended to mask the operation of this 
more subtle, but potentially significant 
variation from profile to profile. This is 
the change in response to specific items 
composing the various scales; it may be 
referred to as item-response change. 

The purpose of this investigation has 
been to study the item-responses on the 
MMPI of certain groups of persons and 
the changes in the item-responses of 
these persons as a result of the passage 
of time plus the occurrence of certain 
events. More specifically, it has been de- 
signed to study the nature of changes 
in the responses to the MMPI items of 
certain groups of psychiatric patients 
and the relationships between item-re- 
sponse changes and status-score changes 
in these groups, and it has been oriented 
toward the problem of detecting and 
measuring therapyinduced changes in 
personality. 

The investigation began with a careful 
check of the approximately 7000 records 
in the MMPI research files at the Uni- 
versity of Minnesota to obtain the names 
of all persons, normals and patients, to 
whom the inventory had been adminis- 
tered on at least two occasions. For all 
Cases, available information on educa- 
tion, occupation, marital status, place of 
residence, etc. was recorded in addition 
to the interval, to the nearest 5-day unit, 
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between the two administrations of the 
MMPI. No in-patients were included in 
the study unless a check of their hospital 
record revealed that they had been dis- 
charged "improved" or "recovered" and 
that the two MMPI profiles had been 
obtained within the limits of a single 
hospitalization, and the length of this 
hospitalization was recorded in addition 
to the above information.’ The hospital 
charts of the patients were also checked 
carefully to determine the exact nature 
of the therapy which the patient had re- 
ceived, and particular note was made of 
the use of electroshock. 

A postadmission MMPI is obtained 
routinely on all patients admitted to the 
Psychopathic Unit of the University of 
Minnesota Hospitals. With the begin- 
ning of this study in September, 1946, a 
special effort was made to collect post- 
therapy, pre-discharge MMPI records on 
all patients. Most of the patients com- 
prising the “psychotic-shock” group for 
this study were obtained in this manner. 

The Psychopathic Unit has a total of 
only 35 beds of which six are in a sepa- 
rate ward for grossly disturbed patients. 
While the number of beds available to 
men and women patients on the general 
psychiatric ward are approximately equal, 
the actual number of women hospitalized 
per year greatly exceeds the number of 
men.? Due to lack of an adequate num- 
ber of male patients for whom two rec- 
ords are available, the present study 
deals entirely with females. 

The normal cases fall into two groups: 
the first is composed of nineteen female 
night-school students, half of whom took 

!In the case of the out-patients included in 
this study, the number of visits to the Out- 
patient Psychiatric Clinic between the two 
MMPIs was recorded. No evaluation of “im- 
provement” in these patients was available. 


* In 1947, the Psychopathic Unit treated a total 
of 180 female and 123 male in-patients. 
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the individual form of the MMPI first 
and one week later took the group form, 
the other half taking the two forms in 
reversed order with a one-week interval 
between them. The second group is 
composed of 23 females selected from the 
files because they had two MMPI rec 
ords. The mean interval between the two 
tests for this second group was 136.4 
days; the median interval was 47.5 days. 
This indicates the marked positive 
skewness which characterized the distri- 
bution of inter-test intervals, The mean 
age of the first group was 32.1 years, 
with a standard deviation of 7.9; the 
mean age of the second group was 26.3 
years, with a standard deviation of 4.4: 
The difference in the mean ages of the 
two groups is not significant (t = 998) 
nor is the difference in the variances sig- 
nificant (F = 2.727). The MMPI test 
and retest means and standard deviations 
for the two normal groups are given in 
Table 1. Only one of the test-retest dif- 
ferences in mean score is statistically sig- 
nificant; this difference occurs on the Pt 
scale in the nightschool sample (t= 
3.95; Р < 01) The test-retest corrcla- 
tion for the Pt scores in this group is 
.783, the highest of such correlations in 
this group for any of the clinical scales. 
If we may assume, on the basis of the 
very marked tendency in both groups of 
normals for the second test means on the 
clinical scales to be lower than the orig: 
inal means and on the basis of similar 
findings in other studies which have indi- 
cated a tendency for the "adjustment 
scores" of normals to change in the direc- 
tion of improvement, that there exists an 

> These cases were originally included in а 
study to determine the equality of thc group 
and individual forms. No significant differences 
were found between the mean scale scores О” 
tained with the two forms, The writer is iP- 


debted to Dr. Howard P. Longstaff for the data 
from these cases. 
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TABLE 1 
© 
Test AND RETEST MEANS AND STANDARD DEVIATIONS OF THE MMPI Scorts or THE 


Two Groups 


or NorMALs* 


? L F K Hs D 
Group Test 
M с м c M c M c M c M c 
Night- 1 12.6 9.7 3.2 2.2 3.4 3.1 55.2 8.6 48.0 5.4 51-4 8.8 
school 2 17.3 19.0 20% 252 2.6 3.2 56.3 8.6 46.0 5.7 49.0 7-3 
File- 1 20.4 17.8 $.4 423 3-0 .0 57.9 7.8 46.3 7.8 47.0 10.0 
random 2 21.9 20.4 3.0 2.0 2:4 1.8 59.0 7.8 45.4 5.7 45-3 7 
Hy Pd Pa Pt Sc Ma 
Night- £ 52.1 6.8 54.0 8.8 53.8 10.2 $2.т 6.6 53.1 7-5 51.4 8.2 
school 2 50.9 7.8 52.9 9.2 50.6 9.3 48.7 6.8 51.5 8.4 49.0 8.7 
File- I 50.4 8.6 49.6 7.1 50.2 8.6 47.8 7.2 52.2 7.8 48.4 9.4 
random 2 48.2 8.4 49.2 8.5 51.0 7.0 47.2 7.8 50.7 6.2 49.2 10.8 
ы The MMPI data in this and all subsequent tables are based on T-scores derived from the usual 
formula: 
X-M 
T=50+10——— 
с 


except for the data for the “?,” L, and F scales which arg in raw score units. 


as yet unanalyzed factor making for a 
regression toward the norm in the per- 
sonality measure of normals, even in the 
absence of originally extreme scores, 
then the appearance of such a factor 
would be maximized in the test-retest 
differences of those scales having the 
greatest reliability. This explanation of 
the change on the Pt scale shown by the 


night-school students is offered in the 
absence of evidence of any more obvious 
factor which might account for the ob- 
served difference. It will be noted from 
Table 2, which presents a summary of 
descriptive statistics for the normal and 
clinical groups, that the combined nor- 
mal group is characterized by a high in- 
cidence of individuals with high school 


TABLE 2 
GENERAL DESCRIPTIVE STATISTICS FOR THE Five SAMPLES TESTED 


Days between 


Age Days in Hospital? MMPIs % % 
Group* Number Single H.S. 
Ë Mean Range Mean Range Mean Range 

—46 61.6 2-1060 78.6 95.0 

Normals 42 35 AE 5.8 2-18 80.6 5-330 32.0 63.2 
PN 25 260 17-30 $2.0 20-114 33-4 5-75 41.7 34.8 
PS.NS 24 8 те 64.4 15-106 33.8 5-65 38.5 30.8 
PSs E FE 20-59 50.6 23-90 27.2 5-40 30.8 83.3 

^ ОРр—25 out-patients. " 
PN—24 hospitalized neurotics, сз who received neither insulin nor electroshock therapy. 


PS-NS—13 hospitalized psychoti се 
PS-S—20 Hospitalized psychotics treated w 
b The figures in this column for the OPD group 
the two MMPIs. E 
* Percent of group with high school 


electroshock. 


fer to number of clinic visits in the interval between 


education or better. 
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education or better, from metropolitan 
areas. 

The clinical groups are four in num- 
ber: 25 out-patients; 24 hospitalized neu- 
rotics; 13 hospitalized psychotics who did 
not receive any type of convulsive thera- 
py; and 20 psychotics who received elec- 


TABLE 3 


Tue COMPOSITION or THE CLINICAL GROUPS 
BY DIAGNOSIS 


Num- 


Diagnosis Бе 


Group 


OPD 


Psychoneurosis (mostly mixed). .| 22 
N=25 ۴ 


Schizophrenia, simple 
Hypomania... 
Undiagnosed. . 


PN Psychoneurosis, mixed. . . 
N=24 hysteria.. а 
hypochondriasis 
psychasthenia . . 
reactive depres- 
BION os aa e s< “4 


PS-NS 


Manic-depressive, depressed... . 
N=13 i E 


manic... 

circular 

Agitated depression 
Schizophrenia, simple. 
catatonic 

Recurrent depression. ... 
Psychosis, undiagnosed 


Manic-depressive, depressed... . 
Involutional, melancholia... Y 
paranoia... 
Schizophrenia, paranoid. . 
hebephrenia . 
undetermined.... 

Paranoid condition. ........... 


troshock therapy. Table 3 presents the 
composition of the clinical groups by 
diagnosis. 

The two MMPI records of each sub- 
ject were tallied on a common form in* 
such a way as to indicate the actual ге- 
sponse (‘“True,” "False," "Cannot Say") of 
the subject to each item on each of the two 
testings. From these combined records, it 
was possible to determine for each indi- 
vidual the total number of changed re- 
sponses, the number of changes by direc- 
tion of change, and the number of 
changes per general type of item. These 
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data for the individuals were then conr 
bined to permit determination of group 
statistics. 

The mean score of each group on each 
of the validity and clinical scales of the 
MMPI was determined for the first and 
second tests and the reliability of the 
differences between the test and retest 
means of each group was determined 
by statistical analysis. 

The intergroup differences in the 
means and variances for the number of 
item-responses changed per individual 
were tested for reliability as were the 
inter-group differences in means and 
variances for the number of responses 
changed per item. 

In order to study the relationship of 
item-response changes to changes in sta- 
tus-scores, the total change-score, i.e. 
change in status-scores, for cach individ- 
ual was determined by adding, without 
regard to sign, the differences between 
his test and retest scores on the clinical 
variables. The ratio of this total changc- 
score (Xcs) to the number of items 
changed (Xic) was determined [or each 
individual and summary statistics for 
this ratio (Xcs/Xic) were determined for 
each of the groups. 

Finally, —item-by-item ^ comparisons 
were made to determine the reliability 
of the differences between the per cent of 
responses changed by the normal group 
and each of the clinical groups respec 
tively. Lists of items with reliable differ- 
ences in frequency of change were com- 
piled and studied to determine if they 
showed any dominant characteristics. 

In view of the small N for each of the 
patient groups, the present study can 
only be considered as exploratory rather 
than definitive, a search for support Or 
refutation of specific hypotheses, rather 
than a basis for final establishment. of 
conclusions. 


í 


CHAPTER III 


RESULTS 


A. MEAN PROFILES FOR THE NORMAL AND 
CLINICAL GROUPS 


HE MEANS and standard deviations 
T each of the groups on each of 
the validity and clinical scales of the 
MMPI, for both administrations of the 
inventory, are given in Table 4. The cor- 
responding profiles are presented in 
Figures 1-5. 

The significance of the difference be- 
tween the first and second administra- 
tion means for a given scale in a given 
group was tested by the following for- 
mula: 


M-2M 


v ух: (ZD): 
NXN-31) 


ln - Ma 


where D — the difference between the 
first and second scores of a given indi- 
vidual. Only those differences have been 
considered reliable which yield a “t” 
score with a probability of less than 1 per 
cent. 

It will be noted with respect to the 
clinical variables that there is a general 
tendency for the mean scores on the 
second administration to be lower than 
those [or the first. Exceptions to this 
occur on the Pd scale in the out-patient 
(OPD) group, on the Mf scale in the hos- 
pitalized neurotic (PN) group, and on 
the D and Hy scales in the psychotic pa- 
tients who did not receive electroshock 
therapy (PS-NS). None of the mean 
score differences of the combined Nor- 
mal Group is statistically significant nor 
is any of the OPD group. The mean F- 
score difference in the Normal Group has 


a P of < .02. 


In the PN group, the second test 
means on the Hs, Hy, and Pd scales are 
significantly lower than the original 
means (P < .o1 in each case). 

The PS-NS group shows no significant 
mean differences on any of the clinical 
scales, The difference between the two 
K-score means is significant at the 1 per 
cent level. 

The sample of psychotic patients who 
received electroshock therapy  (PS-S) 
shows a reliable decrease of mean score 
on the Hs, D, Hy, Pd, Pt, and Sc scales. 
The mean difference on the Pa scales has 
a probability of less than 2 per cent. 

With respect to the validity scales, it 
should be noted that there is a consistent 
tendency for the F-score mean to be 
smaller on the second test, while the re- 
verse holds for the K-score except for 
the PS-S group. None of the mean differ- 
ences for these two variables is reliable, 
however. 

From Table 2, it may be seen that the 
Normal Group is characterized by a 
higher proportion of single persons and 
persons with a high school education or 
better than characterizes the clinical 
groups. From Fig. 1 and Table 4, it 
appears that these selective factors have 
not unduly influenced the "normality" 
of this group insofar as the clinical scales 
of the MMPI are concerned. The two 
profiles follow the 50-score line closely 
and there is not the elevation on the Hy- 
scale to be expected in a college group. 
Furthermore, the test-retest reliabilities 
for the various scales in the combined 
normal group closely approximate those 
reported for normals in the literature. 

In working with personality scores in 
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TABLE 4 


® 


Test-Retest MEANS AND STANDARD DEVIATIONS FOR THE SCORES or EACH GROUP ON THE 


MMPI, WITH SIGNIFICANCE TESTS FOR THE MEAN Dirr 


? L F C 
Group Тез E Hs E 
M c M e M c M E M E M E 
Normals r 17.0 15.2 3.2 2 2 2.6 s6.6 8 8 
or „2 22 2. 6 8.3 6. 49.0 9 
N=42 2 1958 10.9 ow deo 26 26 37.8 8.3 p эю 47:0 77 
ОРО I 14.8 19.0 4.4 2.2 7.9 4.0 $1.3 9.0 7 x 
3 r x Я r» “ 73.5 13-4 
N=25 2 Фо 8o 42 тб бз 3:4 52.4 8.2 fort 12.0 72.7 90 
PN I 14.2 18.6 4.6 0.1 3.9 52.2 9.8 7. 8 8&3 
. è 2.2 k 4.9 13.4 71. 
N-a 2 1116 12.4 4:2 5.2 5.0 53:4 8.5 00:5 12.4 67.6 0.7 
PS-NS I 27.4 23.4 4-3 х2 10.9 9.4 44.6 6.0 58.6 1 6 9 
3 7 3 е 3 64.9 11. 
N=13 2 1.4 9.3 45 1.3 6.6 6.7 52.5 0.8 33 4 603 7.3 
PS.S I 14.4 13.6 йй Ж 8.2 б.о $2.1 7.2 62.8 10.7 74.4 10.8 
N =20 2 23.0 23.4 3.7 2.0 4.0 2.5 5о.7 6.0 53.0 12.8 58.2 0.1 
ж m D. "n "I up 
ormals 1.4 x 2.1 1.0 1.6 2.1 
OPD 2.0 .8 1.6 “7 2.0 3 
PN Jo 1.0 45 .6 3.2% 1.4 
PS-NS 2.4 1.6 2:3 3.0* .2 .5 
PS-S 1.7 1.8 3.2% 1.0 4.0* 4. 2* 


The data for the '*?," L, and F scales are in raw score units. Those for the remaining scales are in T-score uni 
The N's on which the means and standard deviations for the Mf score are based are: OPD 720; PN = 13; PS. 


* Significant at the 1% level. 


which deviation is permitted in one di- 
rection only, in which experience with 
like measures indicates a general phe- 
nomenon of regression in the direction of 
“normality” (ie. lower scores) with a 
simple passage of time, and in which the 
expected direction of change for patients 
receiving specialized therapy is toward 
lower scores on a retest, we would ex- 
pect the retest scores of patients to show 
a reduced range and to find this reduced 
range reflected in lower test-retest corre- 
lations than are found with normals 
tested over a similar interval. In general 
these expectations are borne out by the 
results in Table 5, which show a tend- 
ency for the test-retest correlations to be 
lower for the patient groups than for 


the normals. In view of this argument, 
it may be considered more proper to con- 
sider test-retest correlations in patients 
who have had therapy as indices, at least 
in part, of therapeutic efficacy rather 
than as reliability coefficients. The small 
size of the samples in this study precludes 
testing the significance of the differences 
between these coefficients but their 
values are reflected in the tests of the 
reliability of the differences in corre- 
sponding means since they enter into the 
determination of the standard error of 
the difference. between means for the 
test and retest scores. In this respect, it 
should be noted that Darley (4) interpre- 
ted his test-retest coefficients, which were 
considerably lower than the corrected 


TABLE $ 
Теѕт-КЕТЕЅТ CORRELATIONS OF EACH GROUP ON EACH SCALE 


Group [е L |F | x= | us| D | Hy | ра | Mf | pa | Pe | se | Ma 
Normals (N =42) .718 | .810 | .749 | .663 | +724 | .734 | .682 | 569 ‚585 |. .783 
OPD (N 223) lago | -556 | .797 | .s80 | .657 | .424 | .664 | :568 | .827 | 1365 ев Joo | 1740 
PN (N =24) +884 | .596 | .687 | .763 | .536 | .570 | .617 | :6os -589 | .408 | .475 .608 | .810 
PS-NS* (N —13) 334 | -852 | .375 | -105 | .220 | .707 | .436 | 137$ -052 | .147 | —.303 | .608 
PS-S (N =20) 434 | -555 | -384 | -353 | -908 | -486 | .555 | -431 | -343 | |580 +496 .660 | .456 


* The coefficients for the PS-NS group are product-moment "'r's" corresponding to c, “rho's” i i ble 
4s. р. 362, Garrett, Statistics in Psychology and Education, 2nd edition: £a computed "гов" asigiven inal 
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TABLE 4—Continued 


Hy Pd мъ Ра Pt Sc Ma 
M о м c M с м c M с м с м с 
51.6 8.0 51.6 8.2 51.8 0.5 49.8 7.2 2.6 7.6 49.8 9.0 
40.4 8.2 50.8 9.0 50.8 8.2 47.90 7-4 то 7:2 49.1 9.8 
76.2 11.2 бт 12.2 48.8 12.1 61.2 10.6 69.5 11.4 69,8 12.8 $5.0 11.0 
73.1 12.4 66.2 14.8 48.8 0.8 59.7 9.4 68.4 13.6 67.3 13.2 55.0 10.5 
ir 10.8 63.6 13.2 2.8 8.0 62.5 10.6 66.6 11.8 69.2 12.4 55.2 12.7 
9. 1.3 60.5 If. 46.2 10.6 59.1 10.2 65.3 12.7 67.0 14.4 55.8 2.6 
62.0 12.0 58.8 14.4 70.0 12.3 66.2 11.4 70.0 16.3 50.0 18.0 
63.1 10.4 38.1 0.8 66.0 14.4 63.4 8.0 64.8 15.0 $3.3 17.8 
et E 72.0 15.8 49.4 10.0 70.2 14.2 70.9 17.2 72.2 19.7 56.2 12.2 
55.7 12 57 13.3 49.4 9.2 59.2 19.4 57.6 15.3 8 15.7 2.5 12.3 
ч" , 1 m op "pm 
2.0 ‚6 .8 2.0 1.4 46 
1.6 1,8 = 8 8 1.4 6 
4.0 PW 1.4 1.5 +2 8 4 
2 .2 8 +7. 1.2 
3.4 4:2 1.6 2.6 3.5* 3.6* 1.2 


odd-even reliabilities for his measures, as 
guages of "trait stability," representing 
a non-reducible combination of actual 
change in his subjects plus errors of 
measurement. 

The similarity of the profiles for the 
OPD and PN groups (Figures 2 and 3) 
is notable. The pre- and post-therapy 
profiles for both of these groups show 
the general form which has been de- 
scribed as "typical" for neurotics (6). In 
terms of such typical profiles, the eleva- 
tion on the neurotic triad for these 
groups would suggest that they were 
composed of so-called "mild" psycho- 
neurotics. The height of the secondary 
elevation on the Pa, Pt and Sc scales, 
however, is greater than that to be ex- 
pected in mild psychoneurotics and sug- 
gests that both the out-patient and hos- 
pitalized groups were composed of rather 
severe problems. This might explain at 
the ineffectiveness of the 
ducing the neu- 
ively small 


least partially 
out-patient therapy in re 
rotic triad although the relat 
number of therapy sessions intervening 
between the two profiles (see Table 2) is 


undoubtedly a factor also. 


“Therapy” as provided by the OPD 
clinic requires definition. The patients 
hate their most frequent and extensive 
contacts (for history-taking, therapeutic 
interviews, etc.) with “junior clerks.” 
These are junior medical students who 
are assigned to the psychiatric service 
for a ten-week period. In their OPD ac- 
tivity, they work in close alliance with 
the psychiatric residents and are super- 
vised by the senior staff psychiatrists and 
clinical associates. While the patients are 
seen regularly (weekly or bi-weekly) by 
the junior clerks, they are seen only oc- 
casionally by a staff psychiatrist. 

The rise in the mean Pd score of the 
OPD group, contrasted with the signifi- 
cant decrease in the same score for the 
hospitalized neurotics, presents an іп- 
triguing problem of explanation. One 
hypothesis which might explain these re- 
lationships is based on the relative “so- 
cial acceptance” of the neurotic symp- 
tomatology experienced by the out-pa- 
tient versus the in-patient. In the case 
of the out-patient, the probing of the 
therapist to uncover possible conflictual 
material which might have an etiologic 
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FiGURE 1. Mean test and retest profiles of the combined normals. (N — 42.) 
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Ficure 2. Mean pre-therapy and post-therapy profiles of the out-patients. (N = 
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Ficure g. Mean pre-therapy and post-therapy profiles of the hospitalized neurotics. (N = 24.) 
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role and his apparent discounting of the 
gross symptomatology may result in a 
sensitizing of the patient to the quality 
of his social adjustments to the extent 
that, with later administrations of the 
MMPI, he becomes a “plus-getter” on 


f L К P H, D 
120 


110 


the therapist and patient alike may legit- 
imately noté the progress of therapy in 
terms of the amelioration of gross symp- 
tomatology and there is less of the subtle 
pressure on the patient to continue to 
uncover interpersonal conflicts. 
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FIGURE 5. Mean pre-therapy and post-therapy profiles of the psychotics 
who received electroshock therapy. (N = 20.) 


those items of the Pd-scale which per- 
tain to family and social adjustments, 
while his neurotic symptoms are rela- 
tively unaffected.t The lower mean scores 
on the L and Hy scales for the second 
administration of the MMPI to the OPD 
group, while not significantly different 
from the original means, are in accord 
with this hypothesis. 

In the case of the hospitalized neurot- 
ic both the severity of symptoms which 
justified intramural therapy and the fact 
that hospitalization occurred, mean that 

1 The term “plus-getter” has been applied to 
those individuals with а specific motivation, 


conscious or otherwise, to put themselves in an 
unfavorable or socially undesirable light. 


This hypothesis is supported some- 
what by the data from Table 9 which 
shows the percentage of item responses 
changed for those items classified as per- 
taining to "family and marital" concerns 
(Category XI) to be less for the PN 
than for the OPD group. Conversely, the 
OPD group shows a tendency to smaller 
percentages of item-responses changed 
than the PN group for those items classi- 
fied as pertaining to general neurologic, 
cranial nerves, motility and coordination, 
and sensibility (Categories II, III, IV 
and V). 


It is interesting to note that with psy- 


chotic patients, the ordinary regimen of 


4 
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the psychiatric ward in the absence of 
clectroconvulsive therapy is not adequate 
to achieve a significant reduction of the 
mean scores on the psychotic scales. 
While all of the psychotic scales in Fig- 
ure 4 show reductions in mean score, 
there is a slight though not significant 
increase in the mean scores for the D 
and Hy scales. The relative “mildness” 
of the profiles plus the fact that these 
patients did not receive electroshock 
suggests that they were borderline or 
mild psychotics with extensive neurotic 
coloring and this fact may account for the 
apparent shift from psychotic to neurotic 
symptomatology. Concomitant with this 
shift, there is an increase in defensiveness 
or control as revealed in a mean K-score 
increase of 7.90 (t = 3.000, P < .01). 
Figure 5, which presents the two mean 
profiles for the group of psychotic pa- 
tients who received  electroconvulsive 
therapy is of interest not only because 
of the marked changes it reveals but be- 
cause of the stability shown by the Mf 
and Ma scales. The stability of these two 
scales in a group which shows so much 
change on the other clinical variables is 
of great importance as evidence for the 
selective sensitivity of the MMPI in re- 
flecting therapy-induced changes. Had 
all of the MMPI scales shown signifi- 
cantly reduced means following electro- 
shock therapy, it might be argued that 
all that was being measured was a gross 
tendency to response-change as a result 
of the confusion and memory defects as- 
sociated with EST and that specific per- 
sonality changes were masked. The dif- 
ferential change reflected in the profiles 
of Figure 5 argues for the efficiency of 
the MMPI as an instrument for detect- 
ing the particular results of special thera- 


pies. 
In summary, the data on the mean 


pre- and. post-therapy profiles reveals no 


significant changes for the OPD group, 
reliable improvement in the Hs, Hy, 
and Pd scores for the sample of hospital- 
ized neurotics (PN), no significant 
changes for the psychotics who received 
no electroshock therapy (PS-NS), and re- 
liable improvements on the Hs, D, Hy, 
Pd, Pt, and Sc scores for the psychotics 
treated with electroshock (PS-S). 


B. Group DIFFERENCES IN THE FRE- 
QUENCY AND NATURE OF RESPONSE 
CHANGES 


Table 6 gives the mean per cent of 
responses changed per item for each of 


TABLE 6 


MEANS AND STANDARD DEVIATIONS FOR 
PER CENT or RESPONSES CHANGED 


Grop N M c 

Normals 42 14.0 10.0 
OPD 25 22.0 10.8 
PN 24 18.4 13.6 
PS-NS 13 30.6 13.2 
PS-S 20 30.6 13.4 


F-values 


Group OPD PN PS-NS PS-S 

Normals 1.1 & 8% 1.7* k. 8" 

OPD 1.6* 1.5" 1.0" 

PN І.о 1.0 

PS-NS 1.0 
t-values 


Group OPD PN PS-NS PS-S 


Normals 12.0* 

OPD 

PN 14.4* — 14.0* 
PS-NS 1 


* Significant at 190 level. 
the groups. These values are the same as 
the per cent of all item-responses per 
group which are changed. The per- 
* Let i = no. of items, p = no. of persons in 


group, and c— no. of responses changed per item. 
Then the mean values of Table 6 are derived from 


= ze 
2D which is also equal to —. 
i pi 
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centage of responses changed by the 
normals agrees well with the lower limits 
of comparable values reported in the lit- 
erature. The percentage of responses 
changed in the two subgroups of normals 
were very similar (14.41 per cent and 
15.56 per cent). It should be noted that 
the OPD and PN groups not only show 
an increase in the per cent of responses 
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and PS-S groups are significantly greater 
than that for the PN group. The differ- 
ence between the means of the two psy- 
chotic groups is not reliable, An hypothe- 
sis suggested by the data of Table 6 is 
that neurotic and psychotic patients have 
specific frequencies of response change 
to the MMPI items which are signifi- 
cantly different from one another, signifi- 


TABLE 7 


PERCENTAGE DISTRIBUTIONS OF ITEM RESPONSE CHANGES IN EACH GROUP ACCORDING 
TO DIRECTION OF CHANGE 


Group TtoF FtoT T to? ? toT F to? ?toF 
Normal 373 33.8 6.2 Lys 9.8 qt 
OPD 42.2 43-7 2.4 4.2 2.2 5.3 
PN 46.5 40.4 157 3.1 3.2 fr 
PS-NS 45.9 36.2 1.8 5.5 Y.5 9.1 
PS-S 40.6 40.2 Sä 3:2 б.т 4.5 
ТЕ Ш TorF ? 
Group and to to 
F toT ? | Tor F 
Normal FER 16.0 12.9 
85.9 4.6 9.5 
PN 86.9 4-9 8.2 
PS-NS 82.1 3.3 14.6 
PS-S 80.8 її +5 7.7 


changed over that for the normals but 
that this per cent is practically the same 
for both of the therapy groups, the dif- 
ference between the two values being 
only 3.5 per cent. Likewise, the per cent 
of responses changed is nearly identical 
for the two psychotic groups and is twice 
that for normals, both the shock and 
non-shock treated psychotics changing 
approximately one-third of their re- 
sponses. The mean per cent of responses 
changed per item for the OPD group is 
significantly greater than that for the 
Normal Group. The significantly greater 
variances of the PN, PS-NS, and PS-S 
groups as compared to that of the nor- 
mals prevent testing of the significance 
of the differences among the related 
means. The mean values for the PS-NS 


cantly greater than that for normals, 
and relatively independent of the specific 
type of therapy they аге receiving. 
Table 7 presents data on the frequency 
of response change in each of the groups 
studied in terms of the direction of the 
changed response. Two facts are immedi- 
ately apparent from this table. First of 
all, there appear to be no gross differ- 
ences among the groups in the direc 
tional distribution of the response 
changes. For cach group, the percentage 
of responses changed from "True" to 
"False" is within ten points of the per- 
centage of "False" to “True” responses: 


Secondly, the percentage of response 


changes which involve directly opposite 
responses, i.e., from “True to False" or 
False to True,” constitute, in all groups: 


МӘЖ 
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70 per cent or better of all changes. It 
may be noted that changes involving a 
shift from an originally uncertain atti- 
tude or introspection (as evidenced by a 
"Cannot Say" response) to a definitely 
positive or negative ("True" or "False") 
response constitute from 7-7 per cent to 
12.9 per cent of the response changes in 
the five groups. Since the number of 
"True" and “False” responses on the first 
administrations of the MMPI constitute, 
on the average, 97 per cent of all re- 
sponses, with the remaining 3 per cent 
of responses in the "Cannot Say" cate- 
gory, it is clear from Table 7 that orig- 
inal "True" and "False" responses con- 
tribute a smaller proportion of the total 
of changed responses than would be ex- 
pected from the proportion they consti- 
tute of all original responses. On the other 
hand, the per cents of all response 
changes which are in the "Cannot Say" 
to “True” or "False" direction are from 
two to three times the proportions which 
the “Cannot Say" responses are of all 
original responses. From this data, it 
appears that the "Cannot Say" responses, 
which are not scored on any of the 
MMPI scales, are considerably less stable 
than the "scorable" “True” and “False” 
responses. This finding is in agreement 
with that of Lentz (13) who found the 
proportion of “?” answers changed on 
the Bernreuter inventory to be approxi- 
mately twice the proportions of "Yes" 
and "No" answers which were changed. 
These findings indicate that any investi- 
gation of the differentiating potentiality 
of "Cannot Say" responses relative to 
certain classes of psychiatric patients 
would most likely show such responses 
to reflect variable rather than stable as- 
pects of the personality. The average pro- 
portion of changed responses which are 
in the "Cannot Say" to "True" or 
"False" direction is 10.0 per cent for the 


Íour clinical groups: this value may be 
compared with one of 12.9 per cent for 
the normals. There is no indication that 
therapy-induced changes in personality 
are reflected in an increased tendency 
for originally uncertain attitudes to be- 
come either positive or negative. 

The means and standard deviations 
for the number of  item-responses 
changed per individual for each of the 


TABLE 8 


MEANS AND SrANDARD DEVIATIONS FOR NUMBER 
or ITEM-RESPONSE CHANGES PER INDIVIDUAL 


Group N M c 
Normals 42 74.4 23.6 
ОРр 25 108.6 33.0 
РМ 24 106.5 29.4 
PS-NS 13 151.6 61.6 
PS-S 20 151.6 37.3 
F-values 
Group | OPD PN PS-NS PS-S 
Normals 2.0 1.6 7.2% 2.6% 
OPD 1.2 326% 1:42 
PN 4.6* 1.6 
PS-NS 2.8 
t-values 
Group OPD PN PS-NS PS-S 
Normals 19.2* 18.6* 
OPD .8 1375 
PN 10.2* 


* Significant at 1% level. 


groups is given in Table 8. 'The means 
for the two subgroups of normals were 
71.37 and 77.04 respectively, with stand- 
ard deviations of 19.36 and 26.24. The 
differences between the means and vari- 
ances of these two subgroups is not sig- 
nificant (t = 2.424; Е = 1.818). Again, 
as was noted for the over-all per cent of 
responses changed, there is a remarkable 
similarity of the OPD and PN groups 
and of the PS-NS and PS-S groups with 
respect to the mean number of responses 
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changed per individual. The mean for 
the Normal Group is significantly less 
than that of the OPD group and also 
significantly smaller than that of the PN 
group. The variances of the PS-NS and 
PS-S groups differ significantly from that 
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the general categories of items, as orig- 
inally classified by the MMPI authors, 
was determined for each of the groups. 
These data are given in Table 9, to- 
gether with the rank of each category 
for frequency of response change within 


TABLE 9 
PERCENT or [TEM-RESPONSES CHANGED PER GENERAL CATEGORY OF [TEM FOR EACH Group* 


Category Normals OPD PN PS-NS PS-S 
I 15.3 (9) 24.0 (7.5) 21.8 (14) 31.6 (10 8.3 (2 
II 8.0 (22) 23 |, 24.3(7) р О ea 
III 16.0 (8) 16.7 (22) 18.9 (18) 34-3 (4) 25.0 (18.3) 
IV 7.5 (23) 30.7 (1) 31.9 (2) 37.2 (1) 25.8 (16.5) 
V 7.1 (24) 26.4 (3) 35.8 (1) 29.2 (18.5) 32.0 (8) 
VI 10.7 (17) 19.2 (17) psg 26.2 (20) 27.0 (14) 
VII 10.5 (19) 16.8 (20.5) 23.3 (9.5) 29.2 (18.5) 19.0 (23) 
VIII 6.5 (25) 24.0 (7.5) 27.7 (3) 23.8 (21) 25.0 (18.3) 
IX 11.9 (13.5) 29.6 (2) 19.2 (17) 32.3 (8.5) 21.0 (22) 
x 13.5(12 19.4 (16) 21.9 (13) 34-0 (5) 30.3 (12) 
XI 10.6 (18) 20.0 (15) 15.4 (22) 29.3 (17) 23.7 (21) 
XII 22.4 (1) 22.4 (14)4 27.1 (4) 36.3 (2) 39.2 (1) 
XIII 11.5 (16) 19.0 (18) 14.9 (24 23.1 (22) 25.0 (18.3) 
XIV E 23.8 (10) 26.0 (s) 29.8 (15) 31.3 (9) 
XV 18.0 (6) 16.4 (23) 17.3 (19) 31.2 (11) 25.8 (16.5) 
XVI 21.0 (2) 23.6 (11) 21.6 (15 32-007) 30.7 (11) 
XVII did 23.5 (12) 23.3 (9.5) 30.8 (12.3) 34-5 (s) 
XVIII 11.9 (13.5) 25.3 (4) 23.0 (11 30.8 (12.3) 34.7 (4) 
XIX 18.9 (4) 24.8 (6) Am 34.6 (3) 34.0 (7) 
XX 14.1 (11) 17.6 (19) 16.1 (20) 32.3 (8.5) 29.0 (13) 
XXI 9.4 (21) 16.1 (24) 15.9 (21) 29.5 (16) 26.1 (15) 
XXII 14.5 (10) 23.9 (9) 21.0 (16) 30.2 (14) 34.3 (6) 
XXIII 11.6 (15) 12.0 (25) 10.1 (25) 19.8 (23) 18.6 (24) 
XXIV 20.7 (3) 25.0 (5) 25.3 (6) 33.3 (6) 35.3 (3) 
XXV 9.5 (20) 16.8 (20.5) 15.0 (23) 16.4 (24) 23.7 (20) 


* The figures in parentheses give the ranks of the categories in the order of frequency of changed 


responses within each clinical group. 


of the normals and hence testing of the 
related mean differences is not applica- 
ble. The mean for the OPD group does 
not differ significantly from that of the 
PN group, and the means of the PS-NS 
and PS-S groups do not differ signifi- 
cantly. The PS-S group mean is signifi- 
cantly larger than that of either the OPD 
or the PN group. 

In an attempt to determine whether 
there were any group differences in the 
pattern of response-changes in terms of 
the general types of items, the percentage 
of item-responses changed in each of 


each of the groups. The general categories 
plus the number of items classified in 
each are as follows: 


I. General Health (9 items). 
II. General Neurologic (19 items). 
JII. Cranial Nerves (11 items). 
IV. Motility and Coordination (6 items). 
V. Sensibility (5 items). 
VI. Vasomotor, Trophic, Speech, Secretory 
(10 items). 
VII. Cardiorespiratory (5 items). 
УШ. Gastrointestinal (11 items). 
IX. Genitourinary (6 items). 
X. Habits (20 items). 
XI. Family and Marital (29 items). 
XII. Occupational (18 items). 
XIII. Educational (12 items). 
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XIV. Sexual Attitudes (19 items). 
XV. Religious Attitudes (20 items). 

XVI. Political Attitudes-Law and Order (46 
items). 

XVII. Social Attitudes (72 items). 

XVIII. Affect, Depressive (32 items). 

XIX. Affect, Manic (24 items). 

XX. Obsessive, Compulsive (15 items). 

XXI. Delusions, Hallucinations, Illusions, Ideas 
of Reference (31 items). 

XXII. Phobias (29 items) 

XXIII. Sadistic, Masochistic (7 items). 

XXIV. Morale (33 items). 

XXV. Items modeled after suggestions of Hart- 
shorne, May and Shuttleworth to indi- 
cate whether the individual is trying to 
place himself in an improbably accept- 
able or unacceptable light (15 items). 


Application of the 7?-technique revealed 
no significant diflerence in the distribu- 
tions of frequencies of response changes 
per general category for the two sub- 
groups of normals (2 = 27.94; P > :30). 

While the most "unstable" categories 
in the normal subjects are XII (Occupa- 
tional) and XXIV (Morale), those with 
greatest [requency of response change in 
the OPD and PN groups are IV (Motility 
and Coordination) and V (Sensibility). 
Greatest frequency of change for the PS- 
NS and PS groups is in XII (Occupa- 
tional). There is no clear differential pat- 
tern of frequency of response change per 
category of item. In general, | the first 
nine categories, those dealing with physi- 
cal symptomatology of the various organ 
Systems, Category XVIII (Affect, Depres- 
sive) XXI (Delusions, Hallucinations, Il- 
lusions, etcs.) XXII (Phobias), and XXV 
(the “L” scale items) show a considerably 
Sreater frequency of response change in 
the clinical groups than characterizes the 
normals. 


C. Irem CHANGE IN RELATION TO 


ScoRE CHANGE 

In order to make an appraisal of i 

relationship between the amount е 
item-response change and the degree 


profile change, the profiles of the three 
individuals having the largest, number 
of item-response changes in each group 
were studied together with the profiles 
of the three individuals with the smallest 
numbers of response changes. It was at 
once apparent that no close relationship 
existed between the number of item re- 
sponses changed and the degree of pro- 
file change. 

This brief examination suggested that 
there were individual differences and 
probably group differences in the rela- 
tive specificity of item changes with re- 
spect to profile changes which might be 
interpreted as reflecting therapeutic 
progress. It was thought advisable to de- 
vise a measure of the relationship of 
item-change to scale-score change and to 
study the distribution of this measure in 
each of the special groups. The measure 
used for this purpose was simply the 
ratio of the total number of item-re- 
sponses changed per individual to the 
total amount of change, without regard 
to sign, in the raw scores on the clinical 
variables. Table 10 reports the means 
and standard deviations of this ratio for 
the normals and for each of the clinical 
groups. 

Since only 213 of the 495 items of the 
MMPI included in this study are scored 
on any of the eight psychiatric variables 
covered by those items (Hs, D, Hy, Pd, 
Pt, Sc, and Ma), only 43 per cent of item 
changes would be expected to be re- 
flected in scale score change. Thus, for 
every 10 item-responses changed, a min- 
imum change of 4.3 score points would 
be expected on the clinical scales, if the 
changes in item-response were random or 
chance. If, on the other hand, the item- 
changes were in response to particular 
therapeutic efforts directed at the devia- 
tions measured by certain of the psychia- 
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tric scales, the ratio of item-change to 
score-change should be lessened, ie, a 
given number of item-changes should 
result in a greater amount of scale-score 
change than is true for normals. 

The findings are reported in Table 10. 

The Normal Group shows a mean 
ratio of item-change to score-change 


TABLE 10 


MEANS AND STANDARD DEVIATIONS FOR Ratio 
OF NUMBER or ITEMS CHANGED TO AMOUNT 
ОЕ SconE CHANGE 


Group N M c 
Normals 42 4.0 2.8 
OPD 25 3.3 1.0 
PN 24 3.2 1.2 
PS-NS I3 3.4 1.3 
PS-S 20 3.4 T. 
F-values 

Y 

Group OPD PN PS-NS PS-S 


Normals s.8* 3.8* 
OPD 


$ 3.0 ат 
ї.5 I -6 
PN I E 18 
PS-NS 1.4 
t-values 
Grup OPD РМ PSNS PSS 
Normals 2.6* 3.6* 
PD 1:3 4 E 
PN D -9 
PS-NS E 


* Significant at the 1% level. 


Which is just about what is to be ex. 
pected by chance. The means for the 
clinical Broups are smaller, 
increasing Specificity of j 
increasingly higher Proportion of scale- 
Score items among those Which are 
changed. The mean ratios for the PS.NS 
and PS.S Broups are significantly smaller 
than that for the Nor 
difference betwe 


x indicating an 
tem-change, an 
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One sees, then, that neurotic and psy- 
chotic patients undergoing various thera- 
pies not only tend to change a signifi- 
cantly greater number of their item-re- 
sponses than do normals (Table 8), but 
that their response changes show a signifi- 
cantly greater concentration of items 
which are scored on the psychiatric scales 
of the MMPI. We have in this data posi- 
tive evidence of the potential validity of 
the MMPI for the measurement of thera- 
peutic effects. At the same time, the lack 
of close relationship between over-all 
amount of item change and profile 
change argues for caution in the inter- 
pretation of profile changes. For ex- 
ample, if an alcoholic takes the MMPI 
when not fully recovered from the gen- 
€ral confusional and somatic effects of a 
severe drinking bout and repeats the in- 
ventory later with a more clear sensor- 
ium, one would expect a very large num- 
ber of response changes and probably 
considerable profile change. No matter 
how extensive the profile changes might 
be under these conditions, they should 
not be interpreted as evidence of a 
change in personality but rather as evi- 


dence of recovery from an acute disturb- 


ance of certain perceptual-cognitive and 
somatic functions, Thus, if the F-score of 
the original profile were very high, it 
need not necessarily invalidate the pro- 
file in the usual sense, but rather might 
better be interpreted as indicating that 
the clinical profile reflects an acute dis- 
turbance and does not typify the individ- 
ual’s basic personality. If, on the other 
hand, this same person were tested and 


retested when fully sober, over an in- 
terval in which 


had been attem 


iderably fewer items than 
In the above example. 


bo Z = rr at 
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True personality changes ag reflected in 
profile changes should mean a greater- 
than-chance frequency of change of 
those items which yield scale scores. 


D. CHARACTERISTICS OF THE Most STABLE 
ITEMS AND THE Most VARIABLE ITEMS 
IN EACH GROUP 

Since the variability of the items (per- 
centage of responses per item changed) 
shows a large range in each of the groups 
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found on the F and Sc scales. A similar 
relationship, concentration of the most 
stable items on the F and Sc scales, holds 
in each of the four clinical groups. 

No equally consistent pattern is found 
among the groups for the most variable 
items. In the normals and PS-NS groups, 
the Hy scale has the highest concentra- 
tion of variable items. In the PN group, 
the Hy and Sc scales have the highest— 
and equal—concentrations of variable 


TABLE її 


PERCENTAGE DISTRIBUTIONS, 


BY SCALE, OF THE SCORES YIELDED BY THE Most STABLE AND 


ny THE Most VARIABLE ITEMS FOR EACH GROUP 


Percent of Total Scale-Score Per Scale 


No. ГА Tot. 

Gr ltem % 

oup | Type | og Change Score) L | F | K | Ps | p | Hy | ра [Mi] Pa | Pt [зе | Ma 

N Stable 27 o 38 
ormal | Variable| 28 | as-o*| 25 
OP. Stable 3o 39 
D Variable| 37 31 
PN Stable 49 55 
Variable | 33 45 
PS. Stable 28 0-7.7 31 
S-NS | Variabie| 37 | s3.8+| 35 
PS- Stable 20 | O75 зо 
S | Variable} 32 |$5-0+| 37 


(ев. 0-42.99 per cent in the normals), 
it was considered desirable to determine 
what differences might characterize the 
most stable and the most variable items 
for each distribution. Table 11 presents 
the results of an analysis of the scale 
representations of the two types of item 
in each group. Thus, it was found that 
the 27 items which had a o per cent 
change in the Normal Group (i.e. each 
individual's second response to an item 
was the same as his first response) yielded 
a total of 38 raw scale-score units. Of the 
total amount of scale-score yielded by 
these items, the largest percentages are 


scale-score units should 
exceed the number of items is accounted for by 
the fact that certain of the items are scored on 
more than one scale, Lists of the most stable 
items, i.e., those with “0%” change, are providec 
in the Appendix. 


That the number of 


items. In the PS-S group, the D scale 
shows the highest concentration of varia- 
ble items, with the Hy and Pd scales 
showing the second highest frequency. 
In short, the F and Sc scales are marked 
in all groups by a relatively high fre- 
quency of very stable items; the variable 
items, on the other hand, are irregular 
in their scale concentration from group 
to group. In view of the stability charac- 
terizing many of the items composing 
the F and Sc scales, one would have 
reason to expect any change in the scores 
on those scales to more probably reflect 
valid personality changes rather than 
to be chance variations. 

Table 12 reports for each group the 
proportions of the most stable and most 
variable items for that group which are 
"obvious," "subtle," and not scored on 
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tric scales, the ratio of item-change to 
score-change should be lessened, i.e., a 
given number of item-changes should 
result in a greater amount of scale-score 
change than is true for normals. 

The findings are reported in Table 10. 

The Normal Group shows a mean 
ratio of item-change to score-change 


TABLE 1o 


MEANS AND STANDARD DEVIATIONS FOR RATIO 
OF NUMBER ОЕ ITEMS CHANGED TO AMOUNT 
or SCORE CHANGE 


Group N M c 
Normals 42 4.0 2.5 
OPD 25 $43 1.0 
PN 24 3.2 14 
PS-NS 13 3.4 1.4 
PS-S 20 3.4 t$ 

F-values 

\. 

Group OPD PN PS-NS PS-S 
Normals $.8* 5:89 3.0 2.1 
OPD 1.5 1.9 2.6 
PN 1.2 1.8 
PS-NS 1.4 

t-values 
Group OPD PN PS-NS PS-S 
Normals 240" 2.6% 
OPD 1.3 ү" 4 
PN т.т .9 
PS-NS o 


* Significant at the 1% level. 


which is just about what is to be ex- 
pected by chance. The means for the 
clinical groups are smaller, indicating an 
increasing specificity of item-change, an 
increasingly higher proportion of scale- 
score items among those which are 
changed. 'The mean ratios for the PS-NS 
and PS-S groups are significantly smaller 
than that for the Normal Group. The 
difference between the mean ratios of the 
OPD and PN groups is not significant 
nor is that between the means of the 
PS-NS and PS-S groups. 
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One sees, then, that neurotic апа psy- 
chotic patients undergoing various thera- 
pies not only tend to change a signifi- 
cantly greater number of their item-re- 
sponses than do normals (Table 8), but 
that their response changes show a signifi- 
cantly greater concentration of items 
which are scored on the psychiatric scales 
of the MMPI. We have in this data posi- 
tive evidence of the potential validity of 
the MMPI for the measurement of thera- 
peutic effects. At the same time, the lack 
of close relationship between over-all 
amount of item change and profile 
change argues for caution in the inter- 
pretation of profile changes. For ex 
ample, if an alcoholic takes the MMPI 
when not fully recovered from the gen- 
eral confusional and somatic effects of a 
severe drinking bout and repeats the in- 
ventory later with a more clear sensor- 
ium, one would expect a very large num- 
ber of response changes and probably 
considerable profile change. No matter 
how extensive the profile changes might 
be under these conditions, they should 
not be interpreted as evidence of а 
change in personality but rather as evi- 
dence of recovery from an acute disturb- 
ance of certain perceptual-cognitive and 
somatic functions. Thus, if the F-score of 
the original profile were very high, it 
need not necessarily invalidate the pro- 
file in the usual sense, but rather might 
better be interpreted as indicating that 
the clinical profile reflects an acute dis- 
turbance and does not typify the individ- 
ual’s basic personality. If, on the other 
hand, this same person were tested and 
retested when fully sober, over an in- 
terval in which intensive psychotherapy 
had been attempted, an equally exten- 
sive change in his MMPI profile might 
be found to result from a change in re 
sponse to considerably fewer items than 
were changed in the above example: 


— — 
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True personality changes as reflected in 
profile changes should mean a greater- 
than-chance frequency of change of 
those items which yield scale scores. 


D. CHARACTERISTICS OF THE Most STABLE 
ITEMS AND THE Mosr VARIABLE ITEMS 
IN EACH GROUP 


Since the variability of the items (per- 
centage of responses per item changed) 
shows a large range in each of the groups 
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found on the F and Sc scales. A similar 
relationship, concentration of the most 
stable items on the F and Sc scales, holds 
in each of the four clinical groups. 

No equally consistent pattern is found 
among the groups for the most variable 
items. In the normals and PS-NS groups, 
the Hy scale has the highest concentra- 
tion of variable items. In the PN group, 
the Hy and Sc scales have the highest— 
and equal—concentrations of variable 


TABLE її 


PERCENTAGE 
Bv THE MosT VARIABLE 


DisTRIBUTIONS, BY SCALE, OF THE ScoRES YIELDED BY THE Most STABLE AND - 


ITEMS FOR EACH GROUP 


Percent of Total Scale-Score Per Scale 


No. o Tot. 
Group | Тел nuls | Chanse] 8916 L | F | x= [Hs | p [ну | pa | wr | ра | Pe | sc | Ma 
"—— | 2g | afer] $8 | | SE о EH S as | +s SHE D EC 
om | ване | зе mn) m5)" es] ^" Ера ЕЕЕ 
| 
ram 
еттита и $8 [7 [22° fons аЗ ЕНЕ TEES 
PSS Stable | 22 а E 6.7 | sees mmm 16.3 E КОЕ 8.1 


(eg, 0-42.99 per cent in the normals), 
it was considered desirable to determine 
what differences might characterize the 
most stable and the most variable items 
for each distribution. Table 11 presents 
the results of an analysis of the scale 
representations of the two types of pem 
in each group. Thus, it was found that 
the эу items which had a о per cent 
change in the Normal Group (i.e. each 
individual's second response to an item 
was the same as his first response) yielded 
a total of 88 raw scale-score units.* Of the 
total amount of scale-score yielded by 
these items, the largest percentages are 


a That the number of scale-score units shout 
exceed the number of items is accounted oF by 
the fact that certain of the items are EE 
more than one scale. Lists of the mosi sanie 
items, i.e. those with “0%” change, are provide 


in the Appendix. 


items. In the PS-S group, the D scale 
shows the highest concentration of varia- 
ble items, with the Hy and Pd scales 
showing the second highest frequency. 
In short, the F and Sc scales are marked 
in all groups by a relatively high fre- 
quency of very stable items; the variable 
items, on the other hand, are irregular 
in their scale concentration from group 
to group. In view of the stability charac- 
terizing many of the items composing 
the F and Sc scales, one would have 
reason to expect any change in the scores 
on those scales to more probably reflect 
valid personality changes rather than 
to be chance variations. 

Table 12 reports for each group the 
proportions of the most stable and most 
variable items for that group which are 
“obvious,” "subtle," and not scored on 
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any scale. The "obvious" and "subtle" 
items are composed of all items scored 
on the D, Hy, Pd, Pa, and Ma scales of 
the MMPI and have been keyed by 
Wiener and Harmon (18) according to 
the relative ease or difficulty with which 
the nature of the socially desirable rc- 
sponse may be detected. It will be noted 
that there is a consistent tendency for the 
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non-scored items in the stable and varia- 
ble items is significant for the Normal 
and OPD groups. In short, the variable 
items, as distinguished from the stable 
items, are characterized by a tendency to 
be either of a type in which the direc- 
tion of the socially desirable response is 
not readily perceptible or of a type 
which has not yielded normal-criterion 


TABLE 12 


THE PROPORTIONS OF THE Mosr STABLE AND Most VARIABLE ITEMS FROM EACH GROUP 
WHICH ARE “OBVIOUS,” “SUBTLE,” AND Nor SCORED ON Any SCALE 


The Most Stable Items 


The Most Variable Items 


Group No. of Percent | Percent | 95 Not No. of Percent | Percent | 95 Not 
Items | Obvious | Subtle | Scored Items | Obvious | Subtle | Scored 

Normal 27 25.9 о.о 14.8 28 10.7 32.1 47.9 
OPD 30 23.3 6.7 13.3 37 27.0 16.2 48.6 
PN 49 12.2 6.1 20.4 33 27.3 21.2 21.2 
PS-NS 28 21.4 3.6 28.5 37 21.6 32.4 30.0 
° о.о 20.0 3 6 40.6 


PS-S 20 то. 


32 40.6 T$. 


stable items to consist of higher propor- 
tions of obvious rather than subtle items, 
with a mean difference of 15.3 per cent 
between the proportions of these two 
types of item among the stable items of 
the five groups. No such consistent tend- 
ency appears in the obvious-subtle dis- 
tributions of the variable items. In com- 
paring the stable and variable items, no 
consistent group-to-group relationship 
appears with respect to the frequency of 
obvious items. However, in all groups, 
the variable items show higher propor- 
tions of subtle items than are found in 
the stable items, and for the Normal and 
PS-NS groups the differences between the 
proportions of subtle items in the stable 
and variable items respectively are signif- 
icant (Р < .01), Furthermore, there is a 
consistent tendency for the variable 
items to have a higher proportion of 
items which are not scored on any scale 
than is true for the stable items, and the 
differences between the proportions of 


critical ratios sufficiently large to result 
in their selection for scoring on any of 
the existing MMPI scales. That items 
having a high frequency of changed re- 
sponses upon repetition with a given 
group should show these characteristics 
is to be expected both from the psychol- 
ogy of that behavior which is variable 
and from the nature of item-selection. 
Thus, there are areas of behavior the 
“subtlety” of which is expressed by the 
absence of any clear-cut, readily recog- 
nized pattern of response in the group 
which may serve as a constant standard 
against which the individual can evalu- 
ate or regulate his own conduct. In the 
absence of such social norms, the indi- 
vidual is required to "project" norms 
which vary as a function of recent experi- 
ence and memory of last response in а 
situation. The element of 
uniqueness in all experience plus the 
fallibility of memory makes directly for 
variability of response in the individual. 


similar 


ў 
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This variability expresses itself in the 
group by an absence of a given response 
in sufficient frequency to characterize 
that group in statistical comparison with 
another group. 

As an aside, it might be proposed that 
the ideal questionnaire should be com- 


ing effects to operate in such a way that 
they may be measured, rather than 
naively assuming that they have been 
circumvented by a clever ruse. 

Further study of the stable and varia- 
ble items was made in an effort to dis- 
cover any distinguishing characteristics 


TABLE :3 


THE PROPORTIONS or ITEMS INv 
SucH ESTIMATES IN THE 


OLVING FREQUENCY ESTIMATES AND ITEMS NoT INVOLVING 
STABLE AND VARIABLE ITEMS or EAcH GROUP 


z No, of Items | % With Freq. | _ % Without рашы п 
Group Type of Item No. of hems Estimate | Freq. Estimate | ıo Freq. ER 
Normal Stable 27 23.1 76.9 3.33 
Variable 28 33.3 66.7 2.00 
OPD Stable зо 16.7 82.3 5.00 
Variable 37 40.5 39.5 1.47 
PN Stable 49 14.3 85.7 6.00 
Variable 33 ¿88.2 6r.8 I.62 
PS-NS Stable 28 31.0 69.0 2.33 — 
Variable 37 37.8 62.2 1.64 
PS-S Stable 20 10.0 90.0 9.00 
Variable 32 28.1 7.9 2.55 


posed solely of items of the subtle variety 
which require the subject to “project” 
his own norms, thus gaining for the per- 
sonality inventory the highly regarded 
virtues of the projective techniques. 
There are at least two answers to this 
proposal. In the first place, the variabil- 
ity in response to such items, as has been 
indicated in this study, would suggest 
that they might seriously detract from 
the effort to establish reliable relation- 
ships between the test measures and 
given criteria. Secondly, recognizing that 
there are attitudes peculiar to the process 
of test-taking which are different from 
the particular attributes we are inter- 
ested in measuring and which have cer- 
tain distorting effects on the latter, there 
would seem to be some inherently scien- 
tific cogency to permitting such distort- 


that might not have appeared in the 
above statistical analyses. Other studies 
(10) have indicated that items involving 
an estimate of frequency by the subject 
(e.g, “much of the time," "seldom," etc.) 
have higher rates of change than items 
not requiring such estimates. An analy- 
sis was made of the occurrence of these 
two types of item among the stable and 
variable items of each of the groups in 
the present investigations, with the re- 
sults reported in Table 13. A consistent 
tendency was found for the stable items 
of each group to have a higher propor- 
tion of items not involving an estimate of 
frequency than was true for the variable 
items. For the stable items from all 
groups, the mean ratio of the number of 
items not involving an estimate of fre- 
quency to the number of items involving 
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such an estimate was 5.19; for the varia- 
ble items, the comparable mean ratio 
was 1.86. It appears from this that there 
is a distinct tendency for those items in 
which the subject must make some esti- 
mate of frequency of occurrence of a 
behavior in himself or others (e.g., “I 
feel weak all over much of the time," or 
"I believe I am no more nervous than 
most others") to be more variable in the 
responses they elicit from the same indi- 
viduals than are the items not requiring 
such estimates. 

In a study of the relationship of vari- 
ability of items and the degree to which 
they were stated in present or past tense, 
no consistent group-to-group tendency 
was found. In three of the five groups 
(Normal, PN, and PS-S) the variable 
items yielded a higher ratio of “nur-ber 
of items in the present tense" to “num- 
ber of items in the past tense" than did 
the stable items. In the other two groups 
(OPD and PS-NS), this relationship was 
reversed, the stable items yielding a 
higher ratio of presenttense to past- 
tense items. 'The mean of these ratios in 
the five groups was 4.10 for the stable 
items and 8.10 for the variable items. As 
might be expected, those items involving 
a memory of past occurrences appear 
more frequently among those items to 
which responses are quite stable, while 
those items involving present situations, 
moods, and pressures appear more fre- 
quently among those items to which re- 
sponses are changed. However, the varia- 
bility in the relative concentration of 
present-tense and past-tense items among 
the stable and variable items from group 
to group suggests that the relationship 
between the “time” factor of an item and 
its stability or variability is not a simple 
one but involves other elements. 

Finally, a study was made to deter- 
mine the relative stability of responses to 
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the so-called "double-negative" items. 
These are items which are stated in the 
negative and are scored if answered in 
the negative, that is sorted "False" (e.g 
"I do not tire quickly"). The stable and 
variable items from each of the five 
groups of subjects were examined to 
determine the number of double-nega- 
tive items they included. This survey of 
the 154 stable and 157 variable items 
yielded a total of 33 items of the double- 
negative variety. Of these 33 items, 10 
(or 30.3 per cent) were stable and the 
remainder, 69.7 per cent, were variable. 
In four of the five groups, the ratio of 
the number of double-negative items 
among the variable items to the num- 
ber among the stable items was three to 
one. In the remaining group (PS-NS), 
the ratio was one to one. 

In summary, the above analyses of the 
most stable and most variable items for 
each of the five groups of subjects have 
pointed to certain fairly consistent differ- 
entiating characteristics. ‘The stable 
items tend to be concentrated on the F 
and Sc scales of the MMPI, to be obvious 
in nature, to involve no estimate of the 
frequency of occurrence of a given be- 
havior either in the subject or in other 
people, and to be more frequently stated 
in past rather than present tense. The 
variable items, on the other hand, tend 
to show no clear concentrations on any 
of the MMPI scales, to be subtle rather 
than obvious, to involve estimates of fre- 
quency of given behavior, and to be 
stated in the present rather than the 
past tense. 


E. INVESTIGATION OF A THERAPY- 
SUSCEPTIBLE SCALE 


It was suggested earlier in this research 
that one possible direction to be taken by 
a study of item-response changes would 
be a determination of those items which 
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have a significantly greater. percentage of 
changed responses in persons responding 
favorably to therapy than in normals. 
Theoretically, such a scale of items 
would have both a prognostic and an 
evaluative function. For example, the 
greater the clinical score made by the 
individual on those items having a spe- 
сїйс tendency to change with electro- 
Shock therapy, the greater the probability 
that his post-therapy responses will be in 
the non-psychiatric direction, ie, the 
greater the probability that he will im- 
prove. It follows directly from this that 
the amount of the individual's change 
in score on the therapy-susceptible items 
may be taken as an index of the validity 
of the overall profile change as an indica- 
tion of improvement. Profile changes in 
the favorable direction not accompanied 
by adequate changes on the items specifi- 
cally responsive to the therapy concerned 
would not be misinterpreted as evidence 
of improvement. 

It might be argued that such proposed 
scales of therapy-susceptible items should 
be composed solely of those items for 
which the frequency of change of an origi- 
nally significant response in a given ther- 
ару group was reliably greater than the 
same change in normals. Such a selection 
procedure would miss those equally vari- 
able items which happened on the first 
test to have been answered in the non- 
significant direction. Consequently, the 
following procedure was used to arrive 
at a list of “therapy-susceptible” items 
for each of the treatment groups. 

‘The total number of response-changes 
was determined for each item for each of 
the “improved” clinical groups (PN, PS- 
NS, PS-S) and for the combined nor- 
mals.* These frequencies were converted 


“It has been reported previously that the Z° 
technique showed no significant differences be- 
tween the two normal groups in the manner in 


to percentages. By the use of appropriate 
nomographs (19) the reliability of the 
difference between the percentage of 
changed responses for each item in the 
Normal Group and in a given therapy 
group was determined. Only those differ- 
ences were considered reliable which 
yielded a critical ratio of 3.00 (Р < .003). 
In the cases of percentage differences 
which were borderline with respect to 
the nomographs, the following formula 
was applied: 


h—b 
Factis Nipit NbN f Y, 1 
M+ (- NEN. 0 

By this procedure, three scales of "ther- 
apy-susceptible" items were derived. In 
terms of the arguments given above, the 
items of these scales may be considered to 
constitute a potential measure of progno- 
sis and the scales will be referred to here- 
after as the Px scales. 

Since the hospitalized neurotics ap- 
peared to be the group for which inde- 
pendent validating data could be most 
readily obtained, it was decided to study 
the prognostic effectiveness of the scale 
of items derived for this group. A key 
based on these items was applied to both 
MMPI records of the 24 patients in the 
PN group. Basic statistics for the Px 
scores in this group are given in Table 


which item changes were distributed over the 
twenty-five general categories of items. | 

A study of the significance of the difference 
of frequency of response-change per item in the 
two normal groups revealed a total of only 15 
items which yielded differences significant at the 
5 per cent level. Of these 15 items, only 5 (1 per 
cent of the total number of items) yielded differ- 
ences significant at the 1 per cent level. In view 
of these findings, it was considered permissible 
to combine data from the two normal groups for 
the item study. 

5'The twenty-four items comprising this Px 
scale, аз numbered in the MMPI manual of 
directions, 1943 version, are as follows: Ag, А4, 
Aig, A16, A46, А48, Bı, Во, B11, B12, B14, B15, 
Вау, B28, C43, F2, F18, F49, F55, Gi, Geo, G21, 


H46, 127. 
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14. The mean Px score on the second 
MMPI is significantly lower than that 
obtained in the first administration. 
While this is evidence of the potential 
usefulness of the Px scores for detecting 
therapeutic changes on a group basis, the 
extreme overlap of scores before and 


TABLE 14 


MEANS, STANDARD DEVIATIONS, AND 
SIGNIFICANCE TESTS FOR THE Px SCORE 
IN THE HOSPITALIZED NEUROTICS 
(PN) (N 224) 


M c Range 
ist MMPI.... 11.9 ДЕБ 4-20 
2nd MMPI.... 8.1 4.8 I-21 
1=3.5 F=2.1 
FZ VOY P< .05>.02 


after therapy makes it practically useless 
for individual prediction. 

In order to study the distribution of 
the Px score in independent samples, the 
hospital files were searched for cases of 
diagnosed female psychoneurotics who 
were discharged “unimproved.” Only 
thirty such cases were found and of this 
number, only nineteen had MMPI rec- 
ords. Twenty-nine cases of female рѕу- 
choneurotics discharged “improved” and 
having post-admission MMPI records 
were also chosen randomly from the 
files. Table 15 gives the basic statistics of 
the Px score for these two groups. 

The difference between the mean Px 
scores of the two groups is not signifi- 


cant. 

In addition to the information on 
"condition on discharge" for the two test 
groups, the diagnostic summary of each 
patient carried a statement relative to 
prognosis. These statements were evalu- 
ated in terms of “good,” “fair,” or "poor" 
prognosis. The y? technique revealed по 
relationship between the Px score at the 
time of admission and the prognosis on 


discharge in either the "improved" or 
"unimproved" groups (y? — .989 and 
.254, respectively). 

In order to determine whether апу 
better differentiation would result from 
the use for the Px scale of only those 
items for which the frequency of change 
of an originally significant response was 
rcliably greater in the PN group than in 
the Normal Group, a key was prepared 
for the 11 items from the 24 of the first 
Px scale which satisfied this relationship- 
These items were: А-2, A-13, A-46, B-9, 
B-11, B-14, B-17, B-28, F-18, and I27 
The results of the application of this 
scale to the two test groups of hospital- 
ized neurotics revealed no improvement 
in differentiation over that of the orig- 
inal Px score (t = .446; F = 1.076). 

It would appear from the above data 
that the general approach to the selec- 
tion of therapy-susceptible items which 
has been followed in this study does not 
result in a valid set of items having any 
prognostic power to differentiate pa 


TABLE 15 
MEANS, STANDARD DEVIATIONS, AND 
5 ICANCE TEsTS FOR THE Рх SCORE 
IN SAMPLES OF IMPROVED AND 


UNIMPROVED PSYCHONEUROTICS 
M c Range 
Improved (N=29).. 10.6 4.6 3-20 
Unimproved (N 219) 8.6 gF о-18 
[1.4 FH=1.2 
P<.20>.10 P>.05 


tients who will improve under a given 
regimen from those who will not. This 
conclusion necessarily assumes that the 
criterion used to test the validity of such 
items was in itself a valid criterion. 
In the nine years that the Psycho- 
pathic Unit of the University Hospitals 
has been in operation, a total of 470 fe- 
male admissions has been diagnosed #5 
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psychoneuroses. Of this total, only 6.4 
per cent were discharged “unimproved.” 
A recovery or improvement rate of better 
than go per cent is so much greater than 
the median value reported in the litera- 
ture that one would necessarily suspect a 
biased selection of patients, a therapeutic 
program of particular composition and 
intensity, an inaccurate evaluation. of 
post-therapy status, or an admixture of 
these factors. 

Careful evaluation of the histories of 
the neurotic patients in this study and of 
the larger hospital sample of neurotics 
from which they were drawn does not 
support the hypothesis of biased selec- 
tion of case material. Review of the pro- 
cedures used in the treatment of patients 
by the Psychopathic Unit and analysis of 
the hospital regimen, aside from particu- 
lar therapeutic techniques, does not sup- 
port the hypothesis that the greater im- 
provement rate in the patients of this 
study as contrasted. with those reported 
in the literature is to be accounted for by 
differences in treatment. "There remains 
the probability that the high frequency 


of improved patients chiefly reflects 
errors of omission in the clinical evalua- 
tion of the status of patients and the 
amount of their change at the time of 
discharge. Support to this point is given 
by a readmission rate of 10.5 per cent for 
the female neurotics. 

A more objective determination of the 
posttherapy status of patients than is 
usually available is greatly to be desired. 
Such determination ultimately must re- 
side in the clinical judgment of the psy- 
chiatrist. To the extent that the psychia- 
trist can be encouraged to orient his post- 
therapy evaluation of a patient to а re- 
lationship with the factors underlying 
his assay of pre-therapy status, he will 
provide a valid clinical criterion. With 
such a criterion available, it will be possi- 
ble ¿for research techniques to establish 
independent prognostic and evaluative 
measures which will not only have value 
in the consideration of the individual 
patient but will provide even more im- 
portant objective checks on the efficacy 
of specified therapeutic efforts. 


CHAPTER IV 


SUMMARY AND CONCLUSIONS 


HIS STUDY has had three goals: to 

determine the nature and signifi- 
cance of changes in the MMPI profiles of 
groups of psychiatric patients receiving 
certain therapies, to study the extent and 
nature of item-response changes in these 
clinical groups and to contrast them with 
those found in normals, and, finally, to 
attempt the selection and testing of a 
group of therapy-susceptible items. 

A check of the approximately 7000 
records in the MMPI rescarch files at the 
University of Minnesota yielded a total 
of 124 cases for which two profiles were 
available and which met the other, cri- 
teria for this study. These cases were dis- 
tributed as follows: normals, 42; out- 
patients, 25; hospitalized neurotics, 24; 
hospitalized psychotics not treated with 
any type of shock therapy, 13; hospital- 
ized psychotics given electroshock ther- 
apy, 20. All in-patients included in the 
sample had been discharged “improved” 
or “recovered” and the two MMPI pro- 
files had been obtained within the limits 
of a single hospitalization. Since it was 
not possible to find a sufficient number 
of male cases, the samples were composed 
entirely of females. 

The sample of psychotic patients 
treated with electroshock, though a very 
mixed group, was composed largely of 
depressions апа involutional melan- 
cholias. The average number of grand 
mal treatments received by these patients 
was between six and seven, and the in- 
terval between their two MMPI's was 
slightly less than one month. This group 
showed a significant postshock reduc- 
tion of their mean scores on all of the 
MMPI clinical scales with the exception 
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of Masculinity-Femininity and Hypo- 
mania. Those means which were highest 
before therapy in this group were on the 
Depression, Psychopathic Deviate, Psy- 
chasthenia and Schizophrenia scales, and 
it was on these scales that greatest change 
occurred. 

The sample of hospitalized neurotics 
had a mean initial profile practically 
identical with that of the out-patient 
sample. However, after a mean interval 
of approximately one month, the same a5 
the median interval for the out-patient 
group, the profile for the hospitalized 
neurotics showed a significant decrease of 
the Hypochondriasis, Hysteria and Psy- 
chopathic Deviate scores. It must be re- 
membered that all patients in the three 
hospitalized groups were discharged as 
“improved” or “recovered.” It may be 
concluded that improvement in the per- 
sonality pattern of neurotics under the 
general regimen of the psychiatric ward 
is shown in a significant reduction of 
hypochondriacal and hysterical symp- 
tomatology as measured by the MMPI. 
Apparently, this form of therapy does 
not suffice to reduce any marked degree 
of depression as reflected in the Multi- 
phasic "D" score. The ineffectiveness of 
hospitalization alone to cure depressive 
states in a neurotic syndrome can only be 
judged with respect to the interval of 
intramural therapy represented in our 
sample, namely, approximately опе 
month. 

Within the limits of our small sample 
of psychiatric out-patients and the nature 
and number of psychiatric interviews in^ 
tervening between the collection of the 
two MMPI records, it would not appear 
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that out-patient therapy of the type 
afforded by the University Clinic suffices 
to affect a significant change in the mean 
scores of such patients on any of the 
clinical scales. The out-patient sample 
consisted almost wholly of psychoneurot- 
ics of mixed type who had an average of 
five clinic visits between their tests. No 
clinical evaluation of the nature of their 
response to therapy was available. The 
data of this study would indicate that 
such patients require more intensive 
therapy than that afforded by the out- 
patient clinic or a greater number of 
therapy interviews (or both) before they 
will show a favorable response. This con- 
clusion is presented with certain reserva- 
tions, It may be argued that improvement 
in the outpatient. group occurred in 
areas of attitude, self-evaluation, and 
personal relationship not sampled by the 
MMPI, It may further be argued that 
the validity of the MMPI as a measure of 
therapeutically derived changes in per- 
sonality has not been sufficiently estab- 
lished to warrant its application as the 
sole criterion in evaluating such changes. 
However, in the absence of other criteria 
and with consideration for the validity 
of the MMPI as a picture of the patient's 
pre-therapy status, the data of the pres- 
ent study do not yield evidence of im- 
provement in the psychiatric status of 
the out-patients. 

With respect to the proportion of 
item-responses changed, it was found 
that all of the clinical groups changed a 
larger percentage of their responses than 
was true for the normals, who changed 
only 14 per cent of their responses. Both 
groups of neurotics changed approxi- 
mately go per cent of their responses, 
while the two psychotic groups Бахо 
30 per cent response change. Aside from 
the fact that the clinical groups show a 


greater variability than the normals in 
their responses, it is of considerable 
interest that both groups of neurotics 
show essentially the same rate of change 
in spite of marked differences in treat- 
ment, and that both psychotic groups 
likewise show a nearly identical mean 
frequency of response change in spite of 
the marked differences in the therapy 
they received. "These findings suggest a 
characteristic rate of variability for nor- 
mals, for neurotics, and for psychotics 
which is relatively independent of en- 
vironmental stresses, although the items 
on which the variability is based are dif- 
ferent, On a stability-ofresponse con- 
tinuum, the neurotics would fall be- 
tween the normals and the psychotics, 
somewhat closer to the normals. Further 
investigation of this finding might yield 
a useful technique for solving borderline 
neurotic-psychotic differentiations. 

It has been shown that for normals 
and for groups of neurotics and psy- 
chotics, “Cannot Say" responses have а 
considerably higher rate of change than 
is found for either "True" or “False” 
responses. The proportion of all changed 
responses contributed by changes in 
original “Cannot Say” responses is from 
two to three times greater than the pro- 
portion which “Cannot Say" responses 
are of all original responses. The greater 
variability of the “Cannot Say" re- 
sponses suggests that such responses may 
occur on items which pertain to com- 
ponents of personality which are “valid- 
ly” variable rather than stable. 

By selecting cases from the extremes of 
the distributions of number of items 
changed per individual, it was demon- 
strated than no simple relationship 
holds between the total number of item- 
responses changed and the amount of 


scalescore change effected. Marked 
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changes in profile may be accomplished 
by individuals who have changed rela- 
tively few items, whereas other indi- 
viduals may change a considerably 
greater number of items and yet show 
very little profile change. 

This finding led to a study of the 
ratio of the number of items changed to 
the total amount of change in scale- 
score for the members of each of the 
groups. As a result of this investigation, 
it was discovered that the clinical groups 
not only change a significantly greater 
number of items than do normals, but 
that the items they change include a sig- 
nificantly greater concentration of scored 
items, i.e., items contributing to a score 
on at least one scale of the MMPI. This 
finding is considered evidence of the sen- 
sitivity of the MMPI to therapy-indv сей 
changes in personality. 

While the most stable items for each 
of the groups, normal and clinical alike, 
tended to be concentrated in the F and 
Schizophrenia scales, no similar pattern 
of concentration on any scale or scales 
characterized the most variable items. 

With respect to the "obvious" or 
"subtle" qualities of the items, stable 
items for each group were much more 
often obvious than subtle. No similar re- 
lationship was found for the most vari- 
able items, but these items were much 
more frequently not scored on any of the 
MMPI scales than were the stable items. 
'The tendency for the most variable 
items to be more commonly not scored 
on any of the scales probably reflects the 
fact that when item selection is on the 
basis of group differentiation at a given 
time, items which pertain to the more 
subtle and fluctuating aspects of per- 
sonality, and hence on which there would 
be no marked agreement by a given 
group, are missed. 
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The attempt to derive a scale of items 
which would both predict the response 
of a given type of patient to a given 
therapy and measure the amount of per- 
sonality change to be attributed to that 
therapy was not successful. By sclecting 
those items with a frequency of response- 
change reliably greater in an improved 
patient-therapy group than in normals 
and by keying such items.for the psy- 
chiatrically significant response, it was 
believed that the resultant scale would 
yield scores with a positive relationship 
to prognosis, high scores being associated 
with good prognosis and low scorcs with 
poor. The scale derived by this approach 
for the hospitalized neurotics was ap- 
plied to both sets of MMPI records for 
this group. In accordance with the as- 
sumptions about the therapy-measuring 
potential of such a scale, the mean score 
of the posttherapy records for the hos- 
pitalized neurotics was significantly 
lower than that of the pre-therapy rec- 
ords, although there was a marked de- 
gree of overlap in the two distributions 
of the score, over a fifth of the post- 
therapy scores being above the mean of 
the pre-therapy distribution. 

Application of the therapy-susceptible 
scale for hospitalized neurotics to inde- 
pendent samples of patients discharged 
"improved" and “unimproved” respec 
tively yielded no differentiation. It was 
pointed out in this connection that the 
extreme rarity of hospitalized neurotics 
who were discharged “unimproved” 
would give reason to question the valid- 
ity of the "improved-unimproved" cri- 
terion. Within both the "improved" and 
“unimproved” groups no relationship 
was found between the score on the 
therapy-susceptible scale at time of ad- 
mission and the prognosis at time of 
discharge. 
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Several lines of research are suggested 
by the results of this study. It is obvious, 
of course, that the analyses reported here 
should be repeated for data on males. 
With respect to the detection of non- 
specific profile changes and their dis- 
crimination from — therapy-reflecting 
changes, further study of the character- 
istics of profiles for people with high and 
low ratios of number of items changed 
to amount of score change should be 
made, The frequency of item-response 
changes in other samples of neurotics 
and psychotics should be determined as 


a check on the hypothesis that there is a 
characteristic rate of variability in these 
groups which might be used as a differen- 
tial sign. 

The problem of developing a prog- 
nostic scale for the MMPI would appear 
to be seriously hampered by the lack of 
a reliable clinical criterion of patients’ 
responses to therapy. Until a technique 
may be developed whereby the “condi- 
tion on discharge” will carry more exact 
meaning than it appears to currently, 
scales to predict response to therapy will 
be very difficult to validate. 


APPENDIX 


THE Mosr SrABLE ITEMS IN ЕАСН GRour! 


А. THE TWENTY-SEVEN ITEMS WITH 
“09%” CHANGE IN THE COMBINED NOR- 
MALS (N — 42): 

A11, A19, Ayo, A46, B5, B32, B46, C13, Сы. 
C36, C49, D29, E37, Gii, G35, Hy, Ніц, 
His, H20, H21, H22, H25, H30, H48, 17, J50, 

B. m TWELVE ITEMS WITH 
CHANGE IN THE OUT-PATIENT GROUP 
(N = 25): 

A51, D43, E21, E22, G44, G53, Hı4, His, 
Heo, H27, J45, J48. 
"The items are numbered as in the MMPI 
manual of directions, 1943 version. 
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C. THE FIFTEEN ITEMS WITH “0%” 
CHANGE IN THE HOSPITALIZED NEU- 
ROTICS (N = 24): 

Cg, Ci4, C46, C52, D21, D25, 043, E22, 
; H8, His, H20, Hee, Hey, H41. 

D. THE SEVEN ITEMS WITH "09%" CHANGE 
IN THE “NON-SHOCK” PSYCHOTICS 
(N = 18): 

E15, E20, E49, H38, Ig, J48, J51, 53. 

E. THE SIX ITEMS WITH ud NGE IN 
THE "SHOCKED" PSYCHOTICS (N — 20): 

Cii, Cig, C52, D28, Hey, Ig. 
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a 
A SCALE FOR MEASURING TEACHER-PUPIL ATTITUDES 
AND TEACHER-PUPIL RAPPORT! 


I. THE PROBLEM 


Г VIEW or the vast amount of educa- 
tional research carried on in recent 
years, it is rather surprising that one 
very important problem has almost 
escaped the critical attention of inves- 
tigators. This is the problem of relation- 
ship between teacher and pupil, or the 
more specific phase of this relationship 
as manifested in the personal interaction 
of teacher and pupil in the classroom. 
There are far too many classrooms today 
in which the social atmosphere is little 
above the level of barbarism with refer- 
ence to teacher-pupil interaction. Too 
often, pupils are treated with little re- 
spect, discipline problems rob the teacher 
of energy needed elsewhere, and class- 
rooms are governed by individuals who 
adjusted in personality. 
years undoubtedly have 
: n this prob- 
contributed 


are seriously mal 
Although recent 
witnessed a growing interest i 
lem, research workers have 
little toward its solution, presumably be- 
cause of the difficulty @ scientific ap- 
proach into the field would present. 
Modern educators and school adminis- 
trators are, in most Cases, fully cognizant 
of the problem, but a general feeling of 
helplessness with reference to its solution 
has prevented an energetic effort to 
remedy the undesirable situation. 
Granted the importance of the problem, 
it would scem that an instrument de- 
signed to measure certain aspects of 


a Ph.D. investigation соп- 
ity of Minnesota under the 
direction of Dr. Walter W. Cook, Professor of 


Education, To Dr. W- w. Cook and Dr. Т. W- 
McConnell the writer gratefully acknowledges his 
indebtedness for a challenging problem, m t E 
inspiration and helpful ideas so essential to an 
effective program of attack. 


AND ITS SETTING 


teacher-pupil interaction would prove of 
great value in the hands of educators 
and school administrators. 

The purpose of this study is to explore 
the possibility of constructing a measur- 
ing instrument that will aid in the dif- 
ferentiation of teachers who have or will 
have rapport with pupils, from those 
who do not or will not have such rap- 
port. It is postulated, of course, that rap- 
port between teacher and pupil consti- 
tutes one of the many factors essential 
to teaching success. Although only one 
of many such factors, it is assumed 
[fírther that it is one of the most im- 
portant. Rapport between teacher and 
pupil refers to a state of harmonious re- 
lationship characterized by mutual affec- 
tion, sympathy, understanding, and co- 
operative behavior. Definitely involved 
is the teacher's personality which, on 
assumption, either facilitates or checks 
the establishing of satisfactory rapport 
with pupils. Numerous studies have 
indicated the importance of personality 
to success in teaching. However, the 
present study is not directly concerned 
with prognosis of success in teaching, 
and the personality factor, affecting 
teacher-pupil rapport as it does, is as- 
sumed merely as one very important ele- 
ment in both the vocational life of the 
teacher and the learning career of the 
pupil. 

As attitudes апа feelings probably 
represent the core of human relation- 
ships in all of their manifestations and 
as the area covered by teacher-pupil re- 
lationships is quite large. consideration, 
in general, will be restricted to these re- 
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lations as exemplified in the attitudes 
and feelings each group holds toward the 
other. It will be assumed that a teacher's 
attitude toward pupils and toward chil- 
dren in general is an index to the rap- 
port he has or will have with them. 
Rapport between two people involves 
relationship in two directions. Not only, 
then, must we consider the teacher's atti- 
tude toward pupils, but also how the 
pupils feel toward the teacher whose 
attitude has been determined. In the 
present study the possibility is explored 
of validating the instrument for the 


measurement of a teacher's attitude by 
determining the attitude of the pupils 
toward that teacher. Thus, the reaction 
of the pupils toward each particular 
teacher in the study represents one of 
the criteria against which the present 
instrument is validated. Presumably, a 
relationship between teacher and pupil 
in which the teacher holds an unsatis- 
factory attitude would also be charac- 
terized by antagonism or resentment on 
the part of the pupil, At least, this repre- 
sents a possibility which this study 
attempts to explore. 


` 


‘I. THE CONSTRUZTION, ADMINISTRATION, AND SCORING 
OF THE TEACHER-PUPIL INVENTORY 


A. CONSTRUCTION OF THE ORIGINAL 
INVENTORY 
1. Nature of the Measuring Instrument 
iow “THURSTONE TECHNIQUE of scale 

AL construction (20) was rejected. prin- 
сіраПу because its rigid demand for 
linearity along a degree-of-favorableness 
continuum would seem to present an 
undesirable restriction and limitation. 
Furthermore, the use of judges or €x- 
perts (as in the Thurstone technique) 
Was not contemplated in the present 
study. 

Although the Likert technique. (11) 
was not adopted in its entirety, the type 
of instrument finally decided upon is 
similar in some respects to that of 
Likert’s, The multiple-choice mode of 
statements (each 
a scale in itself) 
of nu- 


response to opinion 
item thus approximating 
was employed. The assigning 
merical values, for scoring purposes to 
each of the several responses per item 
point of similarity, al- 


also represents а 
these 


though the method of weighting 
responses differs considerably from that 
used in the Likert technique. 

The instrument was to be known as 
the Teacher-Pupil Inventory. 


2. Construction of the Items 


Opinion statements were chosen in 


preference to concrete situations with 
multiple-choice reactions. It was felt that 
a relatively large number of opinion 
statements, touching upon à variety of 
topics, would tap more adequately а 
teacher's attitude toward pupils than 
would fewer and less flexible items of 
the other type. In making this choice, 
the investigator was fully aware of the 
discrepancy which so often appears be- 


tween opinion and overt action. Yet, in 
the appraisal of attitude, opinion is per- 
haps as valid an indicator as is potential 
overt behavior. 

The subject was to respond to each 
item by encircling a number represent- 
ing the degree of his acceptance as fol- 
lows: (1) strongly agree, (2) agree, (3) un- 
decided or uncertain, (4) disagree, (5) 
strongly disagree. The following exam- 
ple illustrates a "strongly agree" re- 
action: 


112. Immediate obedience is not always desirable. 
@ 2345 


An extensive survey of the literature 
was made to obtain a wealth of ideas 
concerning all phases of teacher-pupil 
relationship, both in and out of the 
classroom, to be used as source material 
for the building of items. These ideas, 
coupled with those from the writer's own 
provided the basis for the 
compilation of a list of 378 opinion state- 
ments relating to the reaction of teachers 
to children and pupils and their be- 
havior. Items expressing opinions con- 
cerning children in general were in- 
cluded along with those relating directly 
to pupils, as it was felt that a. teacher's 
reaction to children as such is of sig- 
nificance in her attitude toward pupils. 
Items were constructed with children of 
elementary school age primarily in mind. 
It was believed that the Inventory even- 
tually would prove most useful at this 


level. 


experience, 


3. Classification of the Items 

уроп that the employ- 
f classification in 
ld contribute to 
ling of teacher 


On the assum 
ment of some system О 
item construction wou 
a more adequate samp 
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attitude and behavior in relation to 


pupils, the opinion statements were 
classified under five categories which are 


presented here with brief explanatory 
comments and illustrative items: 


(1) Moral status. This phrase is meant to 
cover those adult-child relations essentially char- 
acterized by adult recognition or non-recogni- 
tion of child or pupil as an individual with his 
own inherent rights. Placed under this category 
were items that related to the status of children 
in the eyes of adults, especially as concerns their 


adherence to adult-imposed standards, moral and 
otherwise, 


Example: Children "should be seen and not 
heard." 


(2) Discipline, Under this caption were in- 
cluded those statements relating to problems of 
conduct in the classroom and elsewhere, and the 
methods employed in dealing with them. 


Example: Pupils found writing notes should 
be severely punished, 3 
ere placed those 
ples of child de- 


(3) Child knowledge. Here w 
statements relating to prin 
velopment and behavior ordinarily included in 
such subject fields as educational psychology, 
child Psychology, and mental hygiene. More 
specifically, the content of these items related to 
the topics of ability, achievement, learning, mo- 
tivation, and personality development, 

Example: The boastful child 
confident of his ability. 


is usually over- 


(4) Educational principles, Under this. cate- 
gory were placed iems pertaining to educational 
principles and practices not directly relating to 
child development and behavior. Problems in 
education concerning method, philosophy, cur- 


riculum, and administration. are touched upon 
here. 


Example: Pupils should be required to do 
more studying at home, 


(5) Personal reactions 
under this category were s 
lating to the teacher's pers 
likes and dislikes, 


of teacher. Included 
tatements directly re- 
onal disposition—her 
sources of irritation, etc, 


Example: Without children life would be 
dull. 


Consideration of the abovi 


tion will reveal a rough attempt to identify 
psychological factors in personality structure and 
function, as well as to make à differentiation 
relative to the ideational content of items, It 
was believed, for example, that the cat 
"Moral status" and 


€ system of classifica- 


egories 
"Personal reactions of 


teacher" related to responses that involved mno 
of the affective nature and less of the cognitive, 
than did the other categories. The feeling coms 
ponent in these areas would seem to be stronger 
than the intellectual clement. At least, the ue 
ject-matter that would make up the content > 
items under these two categories was less T 
jective and less well established as ео, 
principles. However, the composition of psyche 
logical reactions to statements of opinion, А 
expressed in terms of affective апа intellectua 
clements, is still essentially a matter of con- 
jecture. š 

Added to this uncertainty is the problem o 
overlapping among the categories. As the classi- 
fication of statements proceeded, the difficulty 
of the process became more and more apparent, 
with an increasing realization that a number of 
the items should be classified under two or more 
categories because of a very evident relevance 49 
each of several topics. The following item, for 
example, was classified under the categoric of 
"Moral status," "Discipline," and "Educational 
principles": 


Unquestioning obedience in a child is not 
desirable. 


And, who is to say that this item, with possibly 
equal justification, could not have been placet 
under the two remaining categories of "Child 
knowledge" and "Personal reactions of teacher? 

Such considerations as the above made the 
whole problem of item classification very difficult 


to handle, especially on a logical and an а 


TABLE т 
NUMBER OF [TEMS IN FINAL INVENTORY OF 
164 ITEMS CLASSIFIED UNDER EACH OF 
Five CATEGORIES 


Category No. of Items 
Moral status 110 
iscipline 43 
Child knowledge 43 
Educational principles 50 
Personal reactions of teacher 43 


priori basis. The classification process was com- 
pleted, however, as it was felt that it would lead 
to а more substantial b 
Table 1 indicates the 
final Inv. 


€ five categories. A number of 


the items were placed in more than one division. 


4. Two Forms of the Inventory 


То facilitate the selection of the best- 


Stated and most discriminating items, 
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two forms (A and B) of һе Teacher- 
Pupil Inventory were constructed. Form 
B was made up of a second list of 378 
items, or opinion statements, each of 
which (with the exception of 8 or 10 
items) was stated in a different manner 
(usually in reverse form) from a cor- 
respondingly numbered related item in 
the first list (Form A). Samples of several 
items from Form А with correlative 
items from Form B are given below. 

1. Most children are obedient. (Form А) 

1. Most children tend to be disobedient. (Form 

B) 
. Pupils expect too much help from the 

teacher in getting their lessons. (Form A) 


Teachers should give pupils more direct 
help with their lessons. (Form B) 


1 


г 


3. The majority of children take their respon- 
sibilities seriously. (Form A) 
3. Most children are irresponsible. (Form B) 


In order to make Form A and Form B 
as equivalent as possible, approximately 
50 per cent of the items in each form 
were so worded that agreement with 
these statements corresponded to the 
same end of the attitude continuum as 
did disagreement with the other 50 рет 
cent of the items. These two kinds of 
statements were distributed at random 
throughout each form. Items represent- 
ing the five categorical topics were also 
placed at random within each form. The 
number of items in each form (378) was 
made amply large to provide for the 
elimination of poor items. 


B. ADMINISTRATION ОЕ THE 
ORIGINAL INVENTORY 


1. Empirical Basis for Standardization 


judges or experts to 
lidity and quality, 
standardization, 


Instead of using 
rate the items as to va 
to establish a basis for 1 
and to devise a System of scoring, 1t was 
the plan of the investigation to arrive at 
these ends by empirical means. That 15, 


the plan was to administer the Inventory 
to two groups of teachers (one group 
characterized as maintaining good rela- 
tions with pupils, the other group as 
maintaining poor relations) to deter- 
mine the differences, if any, between the 
groups in their reaction to the items. In 
other words, instead of logically deter- 
mining in advance how certain teachers 
ought to react to the items and then 
seeing if they measure up to this stand- 
ard, the method of empiricism involves 
the experimental discovery of how 
various teachers actually do react to the 
items, which finding then serves as à 
basis for the interpretation of all subse- 
quent reactions. 

The criterion used in the original se- 
lection of the two groups of teachers 
Was the local school administrator’s judg- 
ment concerning the teacher-pupil rela- 
tionship status of his individual faculty 
members. In. employing this criterion, 
the investigator fully realized that ad- 
ministrators often are not as aware as 
they should be of the actual nature of 
the relationship existing between the 
pupils and individual members of the 
teaching staff. 


a. Nature of the Teacher Population 


The teacher population to which the 
original Teacher-Pupil Inventory was 
administered comprised two groups of 
teachers, 100 in each group, differen- 
tiated on the basis of the local school 
administrator's judgment concerning the 
individual teacher's ability to maintain 
harmonious relations with pupils. This 
was the only point of differentiation that 
was controlled in the selection of the two 
groups. Each group was heterogeneous 
in nature as there was no attempt made 
to control such factors as age, sex, mari- 
tal and parental status, nationality, 
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grade level, subject taught, training, and 
experience. Little consideration was 
given these factors at this stage of the 
investigation, as the essential focus of 
attention here was the difference in re- 
action to items of the Inventory between 
the two groups of teachers who had been 
differentiated solely on the basis of rela- 
tionship with pupils. The writer was 
aware of the possible influence of the 
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the investigators judgment they were, in general, 
educational institutions average in philosophy 
and practice, probably tending toward the tradi- 
tional and conservative type of school. 

Table 2 indicates the extent to which different- 
sized communities were represented in making 
up the two groups of 100 teachers cach. The 
representation, in each group, of the different 
school levels is also indicated. No effort was 
made to equate the two groups relative to the 
size of school system or to the grade level rep- 
resented. Teachers at each extreme of the teacher- 
pupil relationship continuum were used irre- 


TABLE 2 


NUMBER OF SCHOOLS AND TEACHERS REPRESENTIN 


DIFFERENT SCHOOL LEVELS FROM W 


S DIFFERENT-SIZED COMMUNITIES AND 


HICH Two Gnours ОЕ roo TEACHERS 


EACH WERE Drawn 


Superior Group 


Inferior Group 


Size of Community No. of 


No. of No. of No. of 
Schools Teachers Schools Teachers 

Cities with population of Sr. High 2 3 Sr. High I 4 

100,000 and over Jr. High I I Jr. High I 2 

Elem. 9 14 Elem. 8 16 

Cities with population be- Sr. High 9 2r Sr. High 8 20 

tween 1,000 and 100,000 Jr. High 5 10 Jr. High 4 9 

Elem. 17 3o Elem. 18 33 

County schools and towns High | 14* 9 High | тї* 9° 

with population of 1,000 Elem.f 12 Elem.f 7 
and under 

57 100 a Tos 


* Both elementary and secondary grades housed in one building. 


other factors upon teachers’ attitudes, 
but consideration of this problem was 
reserved until a later time. 

An effort was made to contact teachers 
from schools of varying size and descrip- 
tion so as to include individuals with 
varied teaching environments, However, 
the study was limited to teachers in the 
public schools at the elementary and 
high school levels. 


The schools included were located in Penn- 
sylvania and Ohio and ranged in size from one- 
room county schools to the elementary and high 
schools of one of our larger cities, The schools 
represented farming communities, industrial апа 
mining regions, mercantile areas, and wealthy 
residential districts. None of the schools was 
extremely progressive or ultraconservative, In 


Spective of where they were found. Reference 
to Table 2, however, will reveal a close similarity 
In composition of the two groups. 


8. Pe 


By means of a Form for Personal and 
Descriptive Data which accompanied the 
Inventory, а record was kept, 
teacher, of the factors of age, sex 
and parental Status, nationality, grade 
level, subject taught, training, and ex- 
perience. The teacher was also asked 
(а) to indicate whether or not he had had 
а course in menta] hygiene, and (b) to 
rate himself relative to his liking for 
teaching. The name of the teacher was 
not requested. It was felt that anonymous 


rsonal and Descriptive Data 


for each 
, marital 
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responses would approximate more close- 
ly actual attitudes. 


4. Explanation and Directions for 
Completing the Inventory 


Explanation and instructions relative 
to completing the Inyentory were as 
follows: 


Explanation and Directions 


There is considerable disagreement in educa- 
tional circles about a variety of topics involving 
the relationships of teachers and pupils. These 
topics are still largely matters of opinion with 
little reénforcement from scientific evidence. It 
is the purpose of the Inventory to sample exist- 
ing opinion in the area of teacher-pupil rela- 
tionships. There are no right or wrong answers 
to the items of the Inventory. Your own individ- 
ual feeling about the matter is what is wanted. 
A frank and sincere response to each item 15 
obviously necessary if this study is to serve its 
purpose. Your responses will have no effect 
whatever upon your personal or professional 
standing. Your name is nol requested. 
Directions: After reading the item. carefully, 
indicate your response by encircling the number 
which most nearly represents your reaction to 
the topic in question. Think in terms of general 
situations rather than of individual cases. Please 
respond to every item. The scale is as follows: 


1. Strongly agree 


2. Agree 

з. Undecided or uncertain 
4 

5 


. Disagree 
. Strongly disagree 


5. Distribution of the Inventories 


Before the teachers were approach 
permission to carry ОП the investigation 
in the particular school system and 
building was obtained from the superin- 
tendent and principal, respectively, and 
their cooperation Was assured. The pur- 
pose and methodology of the study were 
carefully explained to them. The school 
official feeling most qualified (in most 
cases this was the principal) was asked to 
list, if possible, several teachers on his 
staff whom he considered quite superior 
in their relationships with pupils and 
several at the opposite extreme, whom 


ached, 


he considered quite inferior in this re- 
spect. In making this selection, he was 
to consider the following criteria with. 
reference to each teacher: 
(1) Ability to win the affection of his pupils. 
(2) Fondness for, and understanding of chil- 


dren. 
(3) Ability to maintain a desirable form of 


discipline. 

Although the characteristics above are 
not necessarily correlative, the school 
official generally experienced little diffi- 
culty in making the selection desired. It 
was made clear to him that, among а 
number of factors that go to make up 
the professional requirements of а 
teacher, relationship with pupils was the 
only one that he was to consider in his 
selection. Such factors as intelligence, 
teaching methods, and knowledge of 
subject-matter were not to be used as 
differentiating criteria. 

The investigator presented a copy of 
Form A of the original Inventory to 
each of the зоо teachers in person. 
Each was asked if he would cooperate in 
a research project by completing the In- 
ventory sometime within the next sev- 
eral weeks, at the end of which time the 
writer would return [or it. Each teacher 
was assured that no one but the writer 
would have access to the completed In- 
ventory. It was made clew that the 
project was the writer's orly, and that 
the school system or administration had 
had no part їп the instigation of the 
study. Although there were a few teach- 
ers who did not wish to participate, со- 
ration, for the most part, was excel- 


ope 
lent. For purposes of identification, a 
record was kept of the number of the 


copy that was given to each teacher? 


? Although. consuming much time and effort, 
this individual, personal contact Was considered 
essential to the Success of the investigation. 

2 This number had been placed on the Inven- 
tory quite inconspicuously (immediately follow- 
ing the form designation; e. Form Аб). 
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Thus, from none to five or six teachers 
at each extreme (depending, among 
other factors, upon the size of the teach- 
ing staff) were located in each of sixty- 
seven buildings that were visited in four- 
teen school systems. In this manner there 
were procured 100 subjects in each of 
two extreme groups—one group com- 
posed of teachers who were considerably 
above average in their ability to get 
along with pupils (to be known as the 
“superior” group), and the other group 
consisting of teachers considerably below 
average in this ability (to be known as 
the "inferior" group). 

After a month or so had elapsed, the 
writer revisited the sixty-seven schools 
and again saw, individually, each of the 
200 teachers. At this time Form A of the 
Inventory was taken up and a copy of 
Form B, with a return envelope, was left 
with each teacher, For purposes of 
identification care was taken that the 
number on the copy of Form B given to 
any one individual corresponded with 
that on his copy of Form A. 


C. Comparison OF RESPONSES AND 
SELECTION or ITEMS 


As soon as the Inventories were com- 
pleted and returned, the response of 
each of the 200 teachers to each of the 
378 items in each of the two forms of the 
Inventory was carefully recorded. The 


TEACHERS TO ITEM r (Form 


purpose of tliis work was to com pare the 
responses of the two groups of teachers 
to each of the items in order to deter- 
mine possible differences in reaction. 
Items that did not differentiate the 
groups could then be discarded and a 
scoring system devised for the remaining 
items on the basis of the obtained dif- 
ferential responses. 


1. Comparison of Responses to the Items 


The frequency of cach of the five re- 
Sponses (1, 2, 3, 4, 5) for each item was 
obtained, separately, for each group. As 
there were 100 individuals in each group, 
the frequencies also took the form of per- 
centages. As illustrated in Table $a 
table showing the pattern of response 
Was constructed for each of the 756 items 
(Forms A and В). 


The response pattern in Table 3 is inter- 
preted as follows: 174, Or 87 per cent, of the 200 
teachers are in agreement with this item (in- 
cluding both the “strongly agree" and "agree" 
responses). Thirty-four рег cent of the superior 
group "strongly agree"; 18 per cent of the in- 
ferior Broup "strongly agree”; 3 per cent of the 
superior group "disagree," whereas 15 per cent 
of the inferior Broup "disagree"; the same per- 
centage of each Broup are "undecided" (4 per 
cent); the same percentage "strongly disagree" 
(1 per cent). It is of interest to note that a 
larger percentage of the inferior Broup (64 per 
٤ perior Broup (58 per cent) 
agree” to the item, This apparent discrepancy 
5 by the fact that a rela- 
tively larger Proportion of the Superior group is 

“strongly agree” column. This ex- 


UPERIOR AND INFERIOR Group: 
A) 


1. Most children are obedient, 


Strongly 
ч Strongl 
Agree Agree Undecided Disagree Disagree 
Group I 2 3 4 5 
Superior 34 58 
Inferior 18 EA 4 3 : 
ї 
Difference 16 —6 o то 
_ ° 


ر 
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2 TABLE 4 


DISTRIBUTION OF RESPONSES OF THE SUPERIOR AND INFERIOR GROUPS OF 
TEACHERS TO ITEM 243 (For A) 


s in amusing classroom situations, the class tends to get 


243. If the teacher laughs with the pupil 


out of control. 
Strongly А б Strongly 
Agree Agree Undecided Disagree. Disagree 
Group H 2 3 4 5 
| Superior 1 o H 55 43 
Inferior 1 4 6 67 22 
Difference o —4 —$ —12 21 


other words, one might characterize their re- 
action as one of nominal disagreement, or às 
representing the most widely accepted opinion, 
whereas the reaction of the superior teachers is 
strong enough to indicate an attitude of suffi- 
cient potency to effectively influence their daily 
professional work. With a number of items, in 


planation became more and more plausible as 
the reactions to other items were studied. Con- 
in Table 4 as a second 


sider the response pattern 
example. Over go per cent of the teachers do 
not agree with this item. The most differentiat- 

the groups is No. 5 


ing response between 
(“strongly disagree”). It seems reasonable to 


| 


TABLE 5 
DISTRIBUTION OF RESPONSES OF THE SypERIOR AND INFERIOR GROUPS OF 
TEACHERS TO ITEM 290 (Form A) 
l * 
29o. Most pupils don't appreciate what a teacher does for them. 
Strongly Y Ü Strongly 
Agree Agree Undecided Disagree Disagree 
Group I 2 3 4 5 
| Superior o 38 16 39 7 
Inferior 7 49 15 29 Š 
| Difference EY = т >a N 
I relatively high percentage (67 per this manner, the strength of differentiation lay 
: in the difference in response of the groups rela- 


or group found under Ке- e ' 
general disagreement E reaction at either 
rs of this group COM 
arent unwilling- 


e in their response. In 


tive to the two degrees o 
end of the attitude continuum. 

B Y The responses indicated in Table 5 assume a 
cerning: Ше iter» different pattern. The responses for this item 
ness to go to the extrem 


TABLE 6 
N or RESPONSES or THE SUPERIOR AND IxrERIOR GROUPS OF 
BiH TEACHERS TO ITEM 164 (FORM B) 
| ils t have much use for teachers. 
| 164. As a rule pupils Чо | | — 
Agree Agree Undecided Disagree Disagree 
Group 1 2 5 4 5 
67 25 
Superior I $ E 6t T 
Inferior 
— —12 6 9 
Difference o 3 


| 
| 
| 
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assume a rather striking pattern of symmetry. A 
larger proportion of the inferior group, how- 
ever, are in agreement with the item. It is of 
interest to note that the same number (7) of 
inferior teachers "strongly agreed" to the item 
as superior teachers who “strongly disagreed.” 
There is also a remarkably close agreement be- 
tween the groups relative. to Response No. 3 
(“undecided”). This similarity of response in the 
“undecided” category is characteristic of a num- 
ber of the items. 

There are other items, however, with which 
considerable differentiation between the groups 
occurs in the “undecided” response. Table 6 
illustrates an item in which most of the differen- 
tiation occurs in this category. Although with 
this item 84.5 per cent of the 200 teachers are 
in disagreement, undecidedness seems to charac- 
terize the inferior group more than it does the 
other group. 


2. Selection of Items for the 
Final Inventory 


Only 164 of the original 756 items 
were chosen for use in the final Inventor}, 
the remaining items being rejected. The 
criteria employed in the selection and 
rejection of items were as follows: 

(1) Was the item adequate in differentiating 
the two groups of teachers? 

(2) Was the item ambiguous in meaning, lack- 
ing in clearness, or poorly stated? 


(3) Did the content of an item duplicate that 
of another item that had been selected? 


(4) Did the item show a response pattern that 
was difficult to interpret? 


The statistic, chi-square, was employed 
as an objective measure of the degree of 
differentiation between two response 
patterns. The following formula @ 
рр. 385-386) was used to calculate the 
contribution to chi-square of any given 
pair of responses: 


"WW (&N' — a'N}? where, 
а 


a and a’ represent the obtained column entries 


and N and N’ represent the corresponding row 
totals. 


As a valid and practical measure of the degree 
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of differentiation; between two response patterns, 
chi-square was found to have its limitations. 
Large chi-square values were sometimes the re- 
sult of differences in response that were found 
in two categories only (of which one was often 
the “undecided” response) and of differences in 
response that were irregularly distributed 
throughout the attitude continuum, thereby 
making interpretation difficult. A few items with 
relatively high chi-square values were, conse- 
quently, rejected for use in the final Inventory. 
On the other hand, there were a few items with 
relatively low chi-square values which were se- 
lected for use because of their symmetrical re- 
sponse patterns or because their inclusion in 
the Inventory would lead to a richer and more 
diversified body of content for the instrument. 


The value of chi-square proved to be 
an effective criterion for item selection 
provided the remaining three criteria 
were satisfied. Generally speaking, an 
item (even though possessing a high chi- 
square value) was discarded if it did not 
meet the demands of all three remaining 
criteria. 


Of the 164 items chosen for use in the 
final Inventory: 


79 (48 per cent) met the probability level of 
:05 (x? = 9.488) 

112 (68 per cent 
по(у* = 7.779) 

134 (82 per cent) met the probability level of 
+20 (x? = 5.989) 

146 (89 per cent) met the probability level of 
130052 = 4.878). 


157 (96 per cent) met the probability level of 
БО (у? = 3.357) 


) met the probability level of 


It was in the original plan of the in- 
vestigation to so construct the Teacher- 
Pupil Inventory that, in its final state, it 
would contain from 150 to 175 items in 
each of two forms, As analysis of items 
progressed, however, it became increas- 
ingly apparent that the final instrument 
would be more satisfactory if it consisted 
of one form with a more select group of 
items than of two forms with inferior 
items. The revised Inventory, then, is- 


И 
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š — 9 
sued in one form containing 164 of the 
most select items. 


D. Tur ORIGINAL SCORING SYSTEM 
FOR THE INVENTORY 


The simplified scoring system of 
Likert (use of consecutive numerical 
values 1, 2, 9, 4, 5, assigned to the several 
Tesponses) had been contemplated for 
Use with the present instrument. When 
responses to the items were analyzed, 
however, the idea of using this technique 
Was abandoned. It became increasingly 
evident that, in the great majority of 
Cases, the responses for each item were 
not normally distributed. Rough inspec- 
tion revealed tendencies toward distribu- 
tions bi-modal in character, or with a 
Piling up of responses at one or the 
Other end of the attitude continuum. 
Furthermore, no assumption was to be 
Made relative to uniformly—or non- 
Uniformly—spaced "steps" along the atti- 
tude continuum. 


1. Procedure Used in Calculation of 
Weights for Item Responses 


Consideration of such factors as the 
above, coupled with the desire to make 
the investigation as empirical as possible, 
led to the adoption of a procedure for 
Weighting the several responses for each 
item in accordance with the diferentia- 
tion empirically established between the 
two groups of teachers studied. This was 
the method employed by Strong (17) in 
Weighting the responses to the items of 
his Vocational Interest Inventory. 
Strong's formula, contributed by T- Ly 
Kelley (17, pp. 603-615), was applied to 
the construction of the scoring key for 
the Teacher-Pupil Inventory. The range 
Of weights used extended from plus 4 to 


minus 4. Weights beyond these limits 
were valued at plus 4 and minus 4, re- 
spectively. For each of the five responses 
associated with each of the 164 items in 
the revised Inventory the formula was 
used to compute the weight to be 
assigned to that response. The value of 
W obtained by use of the formula is 
multiplied by 100, and the nearest whole 
number is used. 


E. THE SIMPLIFIED SCORING SYSTEM 
FOR THE INVENTORY 


The system of scoring indicated above, 
with weights ranging from plus 4 to 
minus 4 assigned to the several responses, 
necessitates the rather cumbersome 
proces of adding algebraically a fairly 
long list of positive and negative num- 
bers. Later in the study it was found 
that results produced by a scoring scheme 
which allotted plus 1 to each circled re- 
sponse with a positive weight (disregard- 
ing all circled responses with negative 
and zero weights) correlated .973 with 
results attained by the longer method. 
This simpler scoring method was there- 
fore later adopted. 


Е. COMPARISON OF INVENTORY SCORES OF 
THE SUPERIOR AND INFERIOR GROUPS 
or TEACHERS 


After the original scoring key had 
been established, it was used in the scor- 
ing of the 200 inventories received from 
the two groups of teachers. Forms A and 
B were now combined, and, in the scor- 
ing, consideration was limited to those 
164 items which had been selected for 
the final form of the Inventory. Results 
obtained in' comparing the scores of the 
superior and inferior groups, of teachers 


12 CARROLL H. LEEDS 


TABLE 7 the highest score (324) was obtained by 

RESULTS OBTAINED IN THE COMPARISON OF a Dean of Women in a high school and а 

Merl enum m e score fifth from the top (295) was made 
TEACHERS 


. by a junior high school teacher who also 
1 š . ERO 
Superi Inferior was serving in the capacity of visiting 
isti uperior nieri | A à 
E Group Group teacher. These were the only two indi 


N nae viduals in the sample occupying such 
` IOO t H 

Range 324—(-99) 213—(—297) unique professional status. 

M 131.0 —32.0 

S.D. 102.91 112.46 1. Distribution of Inventory Scores for the 

Км EO 11.18 Superior and Inferior Groups of Teachers 

Mdn 130.0 —41.5 — 
The statistics g, and g, suggested by Fi Pa 

«Ом 15.13 (15) as appropriate tests for the normality ee 

Dx 163.01 


2. " : vo 
CR 8 distribution, were calculated for each of the tW 
SEE d sets of scores. The quantity g, is a measure 9, 
skewness; the closer its value to zero, the ms 
1 ric istri i The negative và 
are presented in Table 7. It may be seen symmetrical the distribution. The negat! 


: wn 
; А N of g, obtained for the superior group, as sho 

oe " up -i жэй: au an 

that, despite some overlapping, there iS jn Table 8, indicates a slight asymmetry with 27 


5 сый š y m si 
a highly significant difference between excess of scores larger than the mean. The p? 


! TABLE 8 


VALUE AND SIGNIFICANCE OF £i AND g2 FOR THE DISTRIBUTIONS OF INVENTORY SCORES MADE 
BY THE SUPERIOR AND INFERIOR GROUPS OF TEACHERS 


ae 
Probability Probability 
NET Š » | of toy e lo; | of tos 
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the means of the two groups of teachers tive value of g, found for the inferior 
(critical ratio: 10.8), Although not shown indicates an excess of scores smaller theo 
: . . ү! [ ү: iş 100 
in the table, 10 of the scores of the in. 120: However, each of the g, values В 

ade to be significant. 
ferior group were above the mean of the 


NE SU 
j The statistic g, is a measure of kurtosis. п r^ 
superior group, and only 4 of the scores value is zero, normality is indicated 59 4 
of the latter group were below the mean his measure is concerned. The negative value ү 
of the inferior group. Also, it is of inter. f obtained for the two distributions TE 15) 
est to note that, in the superior Ei channan ci qiie i 8 
3 Superior group, g, values is too small to be significant. 
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III. THE VALIDITY AND 


RELIABILITY OF THE 


TEACHER-PUPII. INVENTORY 


LTHOUGH SUBJECTIVE in nature, the 
process of composing the items 
contributed to the validity of the instru- 
ment. In a more objective manner the 
administration of the Inventory to two 
groups of teachers at extreme ends of 
an attitude-toward-pupils continuum 
constituted a validating process in estab- 
lishing the validity of individual items. 
Other criteria being satisfied, only items 
that served to differentiate the groups 
were selected for use in the final or re- 
vised instrument. The statistic, chi- 
square, served as the measure of the de- 
grce of differentiation. 
It now remained 
validity of the Inventory as a whole. 


to establish the 


A. DETERMINING THE VALIDITY OF 
THE ÍNVENTORY 


1. Plan of Procedure 

The plan of determining the validity 
of the Inventory essentially involved 
(а) its administration to a new sample of 
100 teachers, (b) the obtaining of ratings 
on these teachers with reference to their 
relationships with pupils, and (c) the cor- 
relation of the teachers’ scores on the 
Inventory with the ratings. These ratings 
were to be obtained from each of three 
Sources: (a) principals under whom the 
teachers were serving, (2) classroom ob- 
servation on the part of the investigator, 
and (c) attitudes of pupils toward each 
individual teacher. 

As a further check on the validity of 
the items of the Inventory, out of the 
new distribution of 100 teachers the 
item response patterns of the twenty 
with the highest Inventory scores and 
the response patterns of the twenty with 
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the lowest scores were to be compared, 
respectively, with the patterns of item 
response characterizing the original 
groups of superior and inferior teachers. 
Also, the responses of the twenty teach- 
ers with the highest scores were to be 
compared with the responses of the 
twenty with the lowest scores. The sta- 
tistic, chi-square, was to be employed as 
the objective means of comparison. 


ю 


. Administration of the 
Revised Inventory 


a. The nature of the teacher popula- 
tion. Inasmuch as the original sample of 
зоо ‘teachers represented the two ex- 
tremes in teacher-pupil relationship, it 
was deemed desirable to include in the 
new sample 100 teachers unselected with 
reference to their relationship with 
pupils. In designation, this group will be 
known as the “unselected” group. 

The 100 teachers who were to serve as 
subjects were fourth-, fifth-, and sixth- 
grade teachers in three different school 
systems, two of which had been used in 
the earlier phase of the investigation, In 
these two systems, however, buildings dif- 
ferent from those used previously were 
visited at this time. The one condition 
under which the 100 teachers were se- 
lected, aside from their status as public 
school employees, was that they have 
teaching contact with children in the 
fourth, fifth, or sixth grade at the time 
of testing. It was immaterial as to 
whether they were teaching one or more 
of these grades. Table 9 indicates the- 
number of cities, schools, and teachers 
representing the different-sized сот- 
munities from which the unselected 
group of 100 teachers was drawn. 
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TABLE 9 


NUMBER or CITIES, SCHOOLS, AND ‘TEACHERS 
REPRESENTING DIFFERENT-SIZED 
COMMUNITIES FROM WHICH THE 

UNSELECTED GROUP OF 100 
TEACHERS Was DRAWN 


No.of No.of No. of 


Community Size Cities Schools Teachers 


Cities with population 


of 100,000 and over т 5 46 
Cities with population 
between 1,000 and 

100,000 2 13 54 

Total 3 18 100 


It was planned to use teachers of the elemen- 
tary intermediate grades for two fundamental 
reasons: (a) Since the inception of the study, it 
had been the investigator’s conviction that the 
prospective Inventory would prove of particular 
value at this important rung of the educational 
ladder. For where else than in the elementary 
grades is the matter of teacher-pupil relation- 
ship of such peculiar importance? Consequently, 
items were constructed with children of ele- 
mentary school age primarily їп mind. (b) It 
was believed that the attitudes of pupils toward 
their teachers (an important validating device 
in the study) could be obtained more effectively 
at the intermediate grade level. Pupils of grades 
four, five, and six would be of sufficient maturity 
to respond adequately to simple items in a ques- 
tionnaire. Furthermore, it was felt that responses 
obtained from these pupils would be of a more 
ingenuous and less sophisticated nature than 
those that would be procured from pupils more 
advanced in educational achievement, who have 
far too often, become comfortably adapted to the 
conventional school situation. 


b. Mechanics of administration. As in 
the previous visitation of schools, the ini- 
tial step at this time was to explain the 
nature of tbe investigation to the admin- 
istrative head of the particular school 
system and obtain the assurance of his 
cooperation in the study. After then in- 
terviewing the principals of the several 
buildings that were to be used and ex- 
plaining the purpose and method of the 
investigation, the writer contacted all 
teachers at the fourth-, fifth-, and sixth- 
grade levels personally and individually. 
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Unless the teacher preferred not to pat 
ticipate in the study, a copy of the re- 
vised Inventory, together with a stamped 
return envelope, was left with him.! 


3. Obtaining the Ratings of the Teachers 


Care was taken that no attempt was 
made to secure further data until all 
copies of the Inventory distributed to the 
teachers had been completed and re 
turned. This precaution was considered 
essential, for any awareness on the part 
of the teacher that he was to be rated 
probably would have influenced his re- 
sponses to the items in the Inventory. on 
the other hand, to avoid bias in the min 
of the investigator in his ratings Q 
teacher behavior, scoring of the Triveni 
tories was deferred until all data relative 
to rating of the teachers had been 
gathered. 

a. Ratings by principal. A rating scale 
was devised on which a principal was io 
rate each of the teacher-subjects 0n his 
staff with reference to relationship with 
pupils. Ratings were to be obtained for 
each teacher on the following aspects 0 
this relationship: 

(1) Disciplinary ability. 


“ " “ya „ point 0Ё 
(2) "Personnel" vs, "subject-matter" point 
view, 


(3) Attitude toward children. 


(4) Understanding of pupil behavior prob: 
lems. 


(5) Personality adjustment, 
(6) Attitude of pupils toward this teacher- 


It would seem that the above ite™ 
are important factors in the person? 
interaction between teacher and рир!’ 
This personal interaction is the aspect 
of the whole teacher-pupil relationshiP 
pattern that is of particular interest in 
the present study. The topics mentionec 


` The Inventor 


š š am 
copies again had been nu 
bered to make іде 


ntification possible. 
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represent several basic dx of teacher- 
pupil relationship from which a number 
of the items of the Inventory were 
drawn, 


The discipline that a teacher maintains in the 
classroom essentially is determined by the per- 
Sonal relationship that exists between him and 
the pupils. Whether it is the tyrannical teacher 
Who “rules with an iron hand" and instills fear 
into his pupils or the supine teacher who is 
unable to “keep order” and whose pupils have 
No respect for him, the problem is principally 
опе of affective and personal relationships. 

The personal interaction between teacher and 
Pupil probably is influenced by the teacher's 
©wn philosophy of education. At one extreme he 
may look upon the child merely as a passive 
receptacle for knowledge and dogmatically cling 
to "blanket" standards of achievement. Or, on 
the other hand, he may regard the mastery of 
Subject-matter as secondary and actively strive 
to meet the varied needs of the child's develop- 
ing personality. 

Item number g in the Rating 
With the teacher's attitude toward children, goes 
Tight to the heart of the problem with which the 
Inventory was designed to cope. Does the teacher 
look upon the child as a sinister and inferior 
being whose greatest virtue is to obey his elders? 
Or, at the other extreme, does the teacher nat- 
“rally love children, and, sympathetically and 
"nderstandingly, appreciate their individual 
Problems of growing up? 

Closely related to a te 
Children is his understan 
Problems, In his reactions tow ) 
Children does he indicate an understanding of, 
and an attempt to apply, the principles voiced 
by mental hygienists? For example, does he con- 
Sider the behavior of the shy, withdrawing child 
35 probably more serious than the aggressive 
transgression against authority? Does he actively 
Seek to determine the causes behind misconduct, 
OF is he satisfied with the consideration of symp- 
toms only? 
ee unquestioned influenc 
M interaction with pupt t or 
m personality adjustment. It is a strong 3 

tion of the writer that teachers with numer 
us disciplinary problems, teachers. who are 
irritated easily by children's actions, and teachers 
Who are often uneasy in the presence of children 
lity difficulties of. their 
heir effectiveness 1n the 


Scale, concerned 


acher's attitude toward 
ding of child behavior 
ard the behavior of 


ce in a teacher's per- 
15 is the status of his 


oh, beset with persona 
Scho, Which undermine t 
Olroom. В 
Ehe attitude of the pupils toward a teacher 15 
tho ay sensed by the principal. Especially is 
1S true with a very popular teacher or wit 


one who is regarded as an enemy. Аз indicated 
elsewhere in this paper, the pupils’ liking or 
disliking for a teacher is considered of basic sig- 
nificance in teacher-pupil relationship. 


When all of the inventories had been 
returned, copies of the rating scale de- 
scribed above, together with mailing fa- 
cilities for their return, were left with 
each of the twelve principals in the 
eighteen buildings visited in this part of 
the investigation. The discrepancy be- 
tween the numbers of principals and 
buildings was due to the fact that, in two 
of the school systems, each principal had 
charge of two or three buildings. 

The mean of the principals’ ratings 
was 5.62 with a range of scores extending 
from 14 (perfect score) to —9 (—14 being 
the Lowest possible score). 

b. Classroom observation. The next 
process in determining the validity of 
the Inventory involved the writer's ob- 
servation of the classroom behavior of 
each of the 100 teachers relative to his 
relationship with pupils. As an aid in 
this observation, use was made of Bax- 
ter's Rating Scale of the Teacher's Per- 
sonal Effectiveness (1), modified so as to 
meet more adequately the needs of the 
present study. Baxter's Scale was devised 
with the principal object of evaluating 
a teacher's personal effectiveness in the 
classroom by considering the pupils’ re- 
sponses to his conduct. This scale was 
modified for use in the present investiga- 
tion by (a) omitting the responses of the 
pupils, (0) omitting two of Baxter's 
items which were somewhat irrelevant to 
the present study, (c) adding an item re- 
lating to the teacher's disciplinary abil- 
ity, and (d) slightly altering the wording 
of Baxter's items in several places. It was 
believed that such modification would 
litate the evaluation of each teacher's 


faci 
nal reactions toward the pupils. 


perso 
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The writer spent from twenty minutes 
to one hour in the classroom of each 
teacher to observe his effectiveness in the 
following: 

(1) Maintaining discipline. 

(2) Creating a friendly classroom atmosphere. 

(3) Establishing a feeling of security. 

(4) Exerting a stabilizing influence. 

(5) Developing pupil self-reliance. 


The significance of the first two points above 
in teacher-pupil relationships is evident. Also 
important in the personal interaction between 
teacher and pupil is the feeling of security that 
a teacher is able to establish in his pupils. He 
may be encouraging and able to inspire con- 
fidence, or he may be intolerant of mistakes 
and definitely lacking in appreciation of the 
pupils’ efforts. 

Effectiveness in “exerting a stabilizing influ- 
ence" essentially depends upon the teacher's 
own personality adjustment. The importance of 
this problem in teacher-pupil relationship can- 
not be over-emphasized. 0 

A teacher's skill in developing pupil self- 
reliance is an important factor in his personal 
relationships with pupils. On the one hand, he 
may encourage pupil decision; on the other, he 
may demand rigid conformity. 

The rating of each teacher relative to 
the above points provided a numerical 
estimate of his personal effectiveness 
with the pupils which served as one cri- 
terion in establishing the validity of the 
Inventory. The mean of the classroom 
Observation scores was 3.60, and the 
range of scores extended from 10 to —9 
(possible range from 10 to — 10). 

c. Attitudes of pupils. 'The third phase 
in determining the validity of the Inven- 
tory involved procuring the reactions of 
pupils to each of the 100 teachers. Of the 
three validating criteria, that considered 
of most importance was the attitudes of 
pupils toward their teachers. Who is in a 
better position to decide what a pupil 
thinks about his teacher than the pupil 
himself who is with him so much of each 
school day? Or, isn't what a pupil thinks 
about the teacher of much importance? 
A positive answer to this question repre- 


sents a basic presupposition of the 
present study. Even though children are 
immature, when gauged by the standards 
of adults, and even though their opinions 
of teachers may change considerably as 
they grow older, it would seem that it 15 
their present attitude, infantile as it may 
seem, that should be of greatest signifi- 
cance to the educator. 

The measuring device constructed 19 
procure an estimation of the pupils’ atti- 
tudes toward the teacher took the form 
of а goitem questionnaire entitled My 
Teacher? The items were simple ае 
tions adapted to the mental maturity 
and experience of fourth-, fifth-, and 
sixth-grade children, that could be am 
swered simply by underlining “Yes, 
“No,” or “?.” 

The questions were formulated by the 
writer on the basis of ideas gleaned fem 
the literature on teacher-pupil relation” 
ships. The content of the questions їс 
lated to the teacher's disposition, his 
treatment of pupils, his teaching, his 
interest in children's activities, his status 
in the pupils’ esteem, his sense of humor, 
and a number of other topics within the 
realm of teacher-pupil relationship. The 
list of questions was made long enough 
to exhaust the source material for item 
content, yet short enough that comple 
tion of the questionnaire would not ta* 
the interest of a pupil in the inter- 
mediate grades. The following questions 
are cited as examples: 


4. Does this teacher scold the 
pupils a lot? 

6. Does this teacher explain the 
School work so that you can 
understand it? Yes No 

11. I5 this teacher often "bossy"? Yes No 


Yes No 


# їп 
*A copy of My Teacher may be found ! 
Appendix D in the writer's PhD. dissertation 


(10) on file in the University of Minnesota L4 
brary. 


x 


= 
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38. Does this teacher give you a and directed to go to the room where the rating 
chance to ask questions? Yes No ? was being done. 
49. Do you like this teacher? Yes No ? The teacher who was being rated was never 


Space was provided on the question- 
naire for the pupil to comment on (a) 
Why I Like This Teacher, and (b) Why 
I Don't Like This Teacher. The pupil 
was also asked to indicate the grade or 
mark which he usually received from this 
teacher. 

As a safeguard against disturbing 
morale, classroom observation on the 
part of the writer was completed before 
Pupil reactions were obtained. 


In procuring these reactions, the general plan 
Was to administer the questionnaire, in person, 
to Broups of about 25 fourth-, fifth-, or sixth- 
grade pupils. As these grades were departmental- 
ized, in whole or in part, in two of the three 
School systems used in this phase of the study, 
the majority of the pupils had classroom contact 
With several teachers during the school day. For 
Practical reasons, in some cases a group of pupils 
rated only one teacher, whereas in other cases, 
а group rated two, or possibly three, teachers. 
However, only опе teacher was rated during 
any one administration of the questionnaire. 

condition that had to be met was that any 
teacher being rated by a group must have teach- 
Ng contact, that school year, with the pupils 
In the group. 

All the data for the investigation were gath- 
Cred during the second half of the school year, 
at which time it was felt that teachers and pu- 
Pils had had ample opportunity for mutual ac- 
quaintance, 

To make conditions as uni 
Pupils were not asked to rate 
teachers, There is reason to believe š 
řoom teacher occupies а somewhat diflerent 
Status in the thoughts and feeling of her home- 
room pupils from that of a teacher not serving 
n this capacity, е š 

Each administration of the questionnaire 

there were ioo separate administrations 11 all) 
required from fifteen to twenty minutes. Groups 
ranged in size from twenty to forty pupils, 
usually numbering about twenty-five, In some 

"ildings it was the wish of the administrator 

at the writer go from room to room and use 
‘he whole class for each separate rating. In 
Others it was considered desirable that the rae 
i done in one room. In the latter an вв 
onc Di-five pupils were selected at rando UAI 

* Of the intermediate grades by the princip 


form as possible, 
their home-room 
that a home- 


present during the administration of the ques- 
tionnaire. In fact, no one was present other than 
the pupils doing the rating and the writer. 
The principal made the following points clear 
to each group of pupils: (а) No one, including 
the teachers and the principal, would ever know 
how they (the individual pupils) answered the 
questions, for no one, except the writer, would 
ever see the papers; (b) the way the questions 
were answered would have no effect whatever 
upon their grades or school standing. In the 
interest of morale, the writer had also requested 
each principal to ask the pupils not to discuss 
the questions afterwards with teachers or other 
pupils. After these remarks, the principal left 
the room. The writer then stressed the de- 
sirability of honest answers, and re-émphasized 
the fact that he would be the only one that 
would ever see the papers, and that even he 
would not know how any individual pupil an- 
swered the questions as no names were to be 
signed’ The writer then read the questions over 
with the pupils, making explanatory comments, 
and answering any questions that were asked. 


The mean of the distribution of scores 
on the pupils’ questionnaires was 20.0, 
the score range extending from 41 to 
—17 (possible range from 49 to —49)- 

d. The reliability of the ratings. The 
reliability of ratings is usually deter- 
mined by correlating the ratings made 
by two comparable judges. In the pres- 
ent study, the use of this procedure did 
not appear feasible or practicable, par- 
ticularly in connection with the ratings 
made by the principals and the writer. 
The method presupposes use of the same 
scale by both judges, whereas the rating 
scale used by the principals was different 
from the one used by the writer. Differ- 
ent scales were purposely employed. It 
was believed that a more extended and 
diversified list of traits to be rated, made 
possible by two different scales, would 
contribute definitely toward the validity 
of the ratings. Even if the same scale had 
n used by both principal and writer, 
elation of the two ratings in deter- 


bee 
corr 
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mining the reliability of the instrument 
would have been questionable procedure 
in that the two judges would not have 
been "comparable" because of differences 
in background, mental set, and knowl- 
edge of the teacher being rated. The 
same problem would have arisen in any 
attempt to employ a third party to do 
the rating in the capacity of a second 
judge. 

The reliability of the ratings made by 
the principals and the writer could have 
been determined by correlating two rat- 
ings made by the same individual at 
different times. It has been contended, 
however, that two ratings by the same 
individual are no better than one (18, 
P. 93). This fact coupled with the 
Writer's hesitancy to impose upow the 
good will of school officials participating 
in the study led to the rejection of such 
à procedure. 

(1) Use of split-half technique. 'The 
method used to determine the reliability 
of the rating scales was the split-half 
technique of correlating the odd items 
against the even items. Use of this meth- 
od with rating scales is not very satis- 
factory because of the small number of 
items, the heterogeneity of their con- 
tent, and the probable operation of the 
halo effect. The halo effect would tend to 
raise the coefficient of reliability, while 
at the same time lowering the validity of 
the scale. Regardless of its disadvantages, 
the split-half method was employed, and 
use was made of the Spearman-Brown 
formula to estimate the reliability of the 
whole scale from the calculated relia- 
bility of the half, 

(2) Reliability of the scale used by 
the principal. By use of the split-half tech- 
nique and the Spearman-Brown formula, 
the reliability coefficient of -87 was ob- 
tained for the principal's scale, Partly 


responsible for this result, undoubtedly. 
were the halo effect and the so-called 
"logical error"—the tendency for a judge 
to give similar ratings in traits ha 
seem logically related in his own think- 
ing. ° 
(8) Reliability of the scale used by 
the writer. A reliability coefficient of .9? 
for the scale used by the writer in class 
room observation was obtained by means 
of the splithalf method and the Spear- 
man-Brown formula. The high value 
probably is due in part to such influences 
as the halo effect and the logical error. 
(4) Reliability of the questionnai 
used by the pupils. The coefficient ©, 
reliability was determined for the pupils 
questionnaire, Му Teacher, by correlat- 
ing the odd items with the even ro 
from 200 papers. These 200 papers were 
chosen by selecting, at random, two 
papers from each of the 100 piles © 
questionnaires relating to cach of the 
100 teachers. The result was found to be 
94 after the Spearman-Brown formula 
was employed. It would seem that the 
halo effect and logical error would b. 
especially potent in their influence upon 
ratings made by elementary school m 
dren. Too, the failure of the split-hal 
method to take account of the fluctua 
tional changes in individuals from da 
to day results іп an estimate of T€ 
liability that is probably too high from 
the standpoint of practical testing. 
Inasmuch as from twenty to forty Р 
pils rated each teacher it was possible t° 
compute the reliability of the instrument 
When used by more than one rate! 
Twenty papers chosen at random from 
each of the 100 piles (one pile for each 
of the 109 teachers) were divided into 
two piles of ten papers each. After find- 
ing the mean score for cach pile of ten 
Papers, the results were used in correlat- 


[> 
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ing the mean of ten ratings with the 
mean of the other ten ratings for each 
of the 100 teachers. The correlation co- 
efficient was found to be .83. This value 
represents the reliability of the instru- 
ment for ten raters. Substituting 2.5 in 
the Spearman-Brown formula, the re- 
liability coefficient, .93, was obtained as 
the estimate of reliability of the instru- 
ment for twenty-five raters. The value for 
twenty-five raters was determined inas- 
much as the majority of teachers were 
rated by groups of twenty-five pupils 
each. 

It would seem that the value, .93, ob- 
tained for twenty-five raters, is of greater 
Significance in the practical use of the 
questionnaire than the value, .94, which 
Was the estimated reliability of the in- 
Strument as such when used by one rater. 
The instrument would ordinarily be 
Used by a class of pupils rather than by 
à single pupil. Furthermore, the value of 
the reliability coefficient determined on 
either ten or twenty-five raters would ap- 
Pear to be closer to the true reliability 
Sought when one considers the errors in 
rating previously discussed in connection 
with the split-half technique. 


4. Comparisons Among Ratings and 
Inventory Scores 


а. The distribution of Inventory 
Scores for the unselected group of teach- 
ers. It is of interest to consider the dis- 
tribution of Inventory scores for the 
unselected group of teachers in relation 
to the distribution of scores for the two 


TABLE то 


RESULTS OBTAINED IN THE COMPARISON OF 
INVENTORY SCORES FOR THE SUPERIOR, 
UNSELECTED, AND INFERIOR GROUPS 
or TEACHERS 


sn Superior  Unselected Inferior 
Statistic Group Group Group 
N 100 100 100 
Range 324—(—99) 254—(— 176) 213—(— 297) 
M 131.0 77-6 —32.0 
S.D. 102.91 103.49 112.46 
ом 10.19 10.35 11.18 
Mdn 130.0 85.5 —41.8 


original groups of teachers. Table 10 
indicates the range of scores, mean, 
standard deviation, and median for each 
of the three distributions. The data are 
based upon the Inventory scores ob- 
tained by the original scoring process. 
As.»ne would expect, both measures 
of central tendency (mean and median) 
for the unselected group lie between 
the central tendency values obtained for 
the superior and inferior groups. How- 
ever, the obtained mean (77.6) for the 
unselected group is somewhat larger 
than the average (49.5) of the means for 
the two extreme groups. The same holds 
true for the obtained median (85.5) as 
compared with the average (44.2) of the 
other two medians. These differences 
may be due to one or more of such fac- 
tors as (a) the limitation of the un- 
selected group to teachers of the fourth, 
fifth, and sixth grades (seventh- and 
eighth-grade teachers in original samples 
were lowest); (b) the possibility that, of 
the two original groups of teachers, the 
inferior group was farther below the 


TABLE 11 


3 с v OF INVENTORY SCORES MADE 
, NCE OF gı AND g: FOR THE DISTRIBUTION O! 
VALUE AND SIGNIFICAD BY THE UNSELECTED Group or TEACHERS 


acm Probability i Probability 
Group | g 1, of ty, : h of to, 
Üniselected ШЕТ | esé | 25 | =ч. | 1.076 | Эш 
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parametric mean of an unselected popu- 
lation than the superior group was above 
it (weak teachers possibly more easily 
identified by principals than strong 
teachers); (c) the possibility that the un- 
selected group was actually composed of 
better-than-average teachers; (d) chance 
errors. 

It should also be observed that the 
range of the scores for the unselected 
group is from 254 to —176, whereas the 
total range for the extreme groups was 
from 324 to —297. The mean of the 
scores obtained by the simplified scoring 


system is 88.4, and the range extends 
from 120 to 38. 


The distribution of Inventory scores for the 
unselected group was tested for normality as 
were the two original distributions, T7 e sta- 
tistics g, and g, were again employed. Table 11 
presents the results obtained. The negative 
value of g, indicates an excess of scores larger 
than the mean or a mean smaller than the 
median. The value of gı does not differ sig- 
nificantly from zero, however, and asymmetry 
is not indicated. Although the negative value of 
Es indicates a tendency toward a flat-topped dis- 
tribution curve, the value of g, is too small to 
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be significant. As with cach of the two pum 
distributions, there is little evidence oF к 
parture from normality in the distribution 
scores for the unselected group of teachers. 


b. Correlations between Inventory 
scores and rating scale scores. Validity d 
efficients were found by computing cor 
relations between scores on the revised 
Inventory and scores on each of the 
teacher rating devices. Table 12 indicates 
the coefficients obtained in warum 
the Inventory results (both original an 
simplified scoring) with results o[ tht 
ratings considered separately and in сол” 
bination. Results obtained in correlating 
the ratings with cach other are also pis 
sented. The single validating айап 
showing the highest correlation (48 i) 
with the Inventory is the writer's аво 
vation of the teacher's classroom be 
havior, whereas the criterion showing 
the lowest correlation (.434) is the eS 
cipals’ ratings. Between these values ! 
found the correlation (.452) of the ЈЕ 
ventory with the attitudes of the pupils. 
The fact that, of the three criteria, the 


TABLE 12 


CORRELATIONS OBTAINED BETWEEN INVENTORY SCORE 


RATINGS CONSID! 
OBTAINED BETWE 


" ND 
S (ORIGINAL AND SIMPLIFIED SCORING) ^ 


PARATELY AND IN COMBINATION, TOGETHER WITH CORRELATIONS 
N THE RATINGS, FOR THE UNSELECT 


Ер GROUP oF тоо TEACHERS 


Inventory and Principals' Ratings 
Inventory and Classroom Observ: 
Inventory and Pupils' Attitudes 


Inventory and Three Validating Criteria Combined 


Inventory and Combination of Classroom Observa- 
tions and Principals' Ratings... 
Inventory and Combination of CI 


assroom Observa- 


Classroom Observations and Principals’ Ratings... 
Classroom Observations and Pupils' Attitudes. 
Principals’ Ratings and Pupils’ Attitudes 


— — 
Original Scoring Simplified Sdoring 
#* or r* @ uem 
8r 
-434 -082 445 Je 
486 677 1488 077 
š .452 o8o 456 .o8o 
+594 +065 -596 -065 
3595“ .065 
-| 2536 072 -544 ә 
+576 067 .579 -067 
E +534 072 .543 | ee 
-| 2480 :077 
-326 +090 
A -389 l :085 | 


* These obtained 7's are all statistically 
** Multiple correlation coefficient. 


significant at the 1 per cent level ( 


8, p. 324). 


А SCALE FOR TEACHER-PUPIL ATTITUDES AND RAPPORT 


principals' ratings show the lowest cor- 
relation is rather interesting, inasmuch 
as the judgments of principals formed 
the criterion on which the selection of 
the original standardization groups was 
made. 

It was believed that the three validat- 
ing criteria combined, or two of them in 
the several possible combinations, would 
prove of greater value in establishing the 
validity of the Inventory than any one 
taken alone. It seemed reasonable to sup- 
pose that the combined judgment of 
several persons or groups relative to a 
teacher’s behavior is of greater worth 
than the judgment of any one single 
agency, To make a combination of meas- 
ures possible, all raw scores were con- 
verted into standard T-scores. 

The correlation between the Inventory 
scores and the three validating criteria 
combined was found to be .594 (-596 
with simplified scoring). When several 
criteria are thus considered together, ex- 
tremes in one or two of them are counter- 
balanced by more moderate ratings in 
those remaining. A “tempering” or 
“leveling” process takes place which, 
apparently, tends to bring the resulting 
score into closer relationship with the 
score on the Inventory. 

A multiple correlation of .595 between 
the Inventory scores and the three cri- 
teria was also obtained. 

Of the several combinations of two 
validating criteria, that of the classroom 
Observations and pupils’ attitudes cor- 
relates highest (.576) with the Inven- 
tory. When compared with each other, 
however, these two criteria show the 
least agreement (r equals .326). In 
Closest agreement are the classroom ob- 
Servations and the principals" ratings 
(r equals .480). The fact that the ratings 
of the principals and those of the writer 
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agree more closely with each other than 
either of these agrees with the reactions 
of the pupils is of interest. Such findings 
suggest possible differences in standards 
of evaluation by which adults and chil- 
dren judge the behavior of teachers. 
Support is also given the view that pupil 
rating is an important and unique con- 
tribution to an adequate evaluation of 
teacher behavior. 

Table 12 also presents the correlation 
coefficients computed when using the 
Inventory scores obtained by use of the 
simplified scoring procedure. It is inter- 
esting to note that, in each case except 
one (comparison between Inventory and 
classroom observations), these correla- 
tions are slightly higher than those ob- 
tained when using the original Inventory 
scores, These slight differences, though 
quite consistent, are inconsequential, 
however, in comparison with the close 
agreement between the two sets of values. 


5. Ап Additional Check on the Validity 
of the Items 


An additional check was made on the 
validity of the items in the revised in- 
strument. Out of the distribution of the 
100 teachers in the unselected group the 
item response patterns of the twenty 
with the highest scores and the response 
patterns of the twenty with the lowest 
scores were compared, respectively, with 
the patterns of item response characteriz- 
ing the original groups of superior and 
inferior teachers. Chi-square was the 
statistic used as the measure of com- 
parison. 

It seemed reasonable to suppose that, 
if a particular item were valid, it would 
call forth similar patterns of response 
from the two superior groups of teachers 
and similar patterns from the two in- 
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ferior groups. Of the 328 (2 x 164) com- 
parisons, 270, or 82 per cent of them had 
chi-square values less than 9.488, a value 
which is at the 5 per cent level of sig- 
nificance. A chisquare value less than 
9.488, therefore, indicated a high prob- 
ability that differences found between 
the two groups, in response to a particu- 
lar item, could be chance differences. Al- 


TABLE 13 


DISTRIBUTION OF RESPONSES OF THE SUPERIOR AND INFERIOR GROUPS or TEACHERS AND OF 
THE HIGHEST 20 AND LOWEST 20 TEACHERS IN 
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agreement or disagreement but reveal discrepan- 
cies in the degree of reaction. The group of 
twenty teachers, at each extreme, generally 
tended to exhibit a more intensive attitude rela- 
tive to the opinion statements. 


B. DETERMINING THE RELIABILITY OF 
THE INVENTORY 


The splithalf method was used in de- 
termining the reliability of the Teacher 


THE UNSELECTED Group TO ITEM 11 


тт. Pupils are naturally lazy when it comes to doing school work. 


Strongly А Б Strongly 
Agree Agree Uncertain Disagree Disagree 
Group I 2 3 4 5 
Superior o 7 11 73 9 (N= 100) 
Inferior 3 17; 14 63 3 (N= 100) 
Highest 20 1 2 o 9 8 (N=20) 
Lowest 20 1 5 2 10 2 (N=20) 


though a higher proportion than 82 
per cent would have been desirable, 
this result does lend further support to 
the validity of the items in the Inven- 


tory. 


It is of interest to analyzse further, by in- 
spection, the response patterns of an item that 
scems to be representative of those cases where 
the chi-square value was greater than 9.488. The 
response patterns for Item 11 are indicated in 
Table 13. The chi-square value of 20.618 was 
found, for Item 11, ОП comparing the response 
patterns of the 100 superior teachers with those 
of the highest twenty in the unselected group. 
Contributing to the high value of chi-square are 
the differences between the two groups relative 
to responses "4" and "5." The great majority 
of both groups of superior teachers disagree with 
the statement. Yet, a much larger proportion 
(qo per cent) of the twenty teachers "strongly 
disagree" than of the 100 teachers, where only 
9 per cent "strongly disagree." This relatively 
high proportion (40 per cent) undoubtedly ac- 
counts for the small representation found under 
responses "3" and "4" of the group of twenty 
teachers. As is true with the other items show- 
ing high chi-square values, the response patterns 
indicate validity of the item relative to general 


Pupil Inventory. Correlating the score 


on the odd-numbered items with those 
on the even-numbered items, à correla- 
tion coefficient of .833 was obtained. Use 
of the Spearman-Brown prophecy 100 
mula resulted іп а reliability coefficient 
of .gog for the entire instrument. 

The reliability of the Inventory pes 
also determined, by means of 
half method, on the basis of t 
obtained by the simplified scoring 
cedure. Under these conditions the odd- 
even correlation was .794. Determined by 
use of the Spearman-Brown formula, the 
reliability coefficient for the entire 1 
strument was found to be .885. The 
standard error of measurement, dete" 
mined on the basis of the simplified sco 
ing system, was 6.25. May it be men 
tioned again at this point that the sin" 
plified scoring procedure yielded results 
which correlated .973 with the origin? ЖА 
scores. 


he scores 
pro i 


IV. SUMMARY AND CONCLUSIONS 


1. The major purpose of this investi- 
gation was to construct a measuring in- 
strument which would gauge the atti- 
tudes of teachers toward pupils and serve 
to differentiate those teachers who get 
along well with children from those who 
do,not. The instrument, known as the 
T'eacher-Pupil Inventory, in its revised 
form, consists of 164 statements of 
opinion relating to pupils and children 
and the part the teacher should play in 
their educational development. Admin- 
istration time is from twenty to thirty 
Minutes. 

For purposes of standardization and 
Scoring, the original instrument, consist- 
ing of 756 items (2 forms with 378 items 
each), was administered to 100 teachers 
in each of two groups, one at each ex- 
treme of the teacher-pupil relationship 
Continuum, and comparison then made 
of the item responses. The statistic, chi- 
Square, served as the objective measure 
of comparison. Those items not differ- 
€ntiating adequately were rejected. A sys- 
tem of scoring was developed on the 
basis of the responses of the superior 
and inferior groups of teachers. 

2. Significant aspects of the present 
investigation included the use of empiri- 
cal procedures of test construction and 
the use of pupil reaction as а basic vali- 
dating criterion. The validity of individ- 
ual items in the instrument was estab- 
lished by the use of the chi-square 
technique in the comparison of the re- 
Sponse patterns provided by the two 
Sroups of teachers at the extremes of 


and the revised form 
tory may be found in 
respectively, in the 
(10) on file in the 
Plans are being 
in the near fu- 


of Ве two original forms 
he Teacher-Pupil Inven 

Ppendices E, F, and G, 
Writer's Ph.D, dissertation 
niversity of Minnesota Library. 


made to š 
nventor| 
ture, publish the I y 


23 


teacher-pupil relationship. The validity 
of the Inventory as a whole was deter- 
mined by administering it to a new 
sample of 100 teachers from the fourth- 
fifth-, and sixth-grade levels, unselected 
as to teacher-pupil relationship, on 
whom ratings were then obtained from 
each of three sources: (а) the principal, 
(b) the writer (from classroom observa- 
tion), and (c) the pupils. 

When correlated with Inventory 
scores, the ratings of the principal, the 
writer, and the pupils gave validity co- 
efficients of .434, .486, and .452, respec- 
tively. After combining the three ratings, 
the combined result correlated .594 (.596 
with simplified scoring) with the In- 
ventory scores. A multiple correlation of 
.595 between the Inventory and the three 
criteria was also obtained. These cor- 
relations were all significant at the 1 per 
cent level. 

The means of the Inventory scores of 
the superior, unselected, and inferior 
groups of teachers were, respectively, 
131.0, 77.6, and —32.o. The difference 
between the means of the superior and 
inferior groups yielded a critical ratio 
of 10.8. By statistical test (use of g, and 
g), none of the three distributions of 
Inventory scores was found to depart sig- 
nificantly from normality. 

3. The reliability of the Teacher-Pupil 
Inventory was found to be .gog as deter- 
mined by means of the split-half method 
and the Spearman-Brown prophecy 
formula. On the basis of scores obtained 
by the simplified scoring system, the re- 
liability coefficient was .885. 


This investigation has shown that 
teachers’ attitudes toward pupils and 
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their behavior are related to teacher- 
pupil rapport in the classroom. It has 
shown further that these attitudes can be 
measured with as high a validity (.595) 
as can academic aptitude. 

These findings are of significance in 
teacher training, selection, and evalua- 
tion. It is true that the validity of the 
Inventory was determined under favora- 
ble conditions of test administration. 
Further research? is needed to demon- 

? Research is under way which should lead to a 


more refined instrument and a greater knowl- 
edge of its measurement possibilities. 


strate the validity of the instrument for 
purposes of selection and evaluation of 
teachers under actual "operating condi- 
tions" of administration. 

The day may not be far distant when 
teacher-training institutions will be able 
to take account of a teacher's personal 
effectiveness in the classroom in as ob- 
jective a manner as is donc at present 
with teaching techniques and knowledge 
of subject matter. It is hoped that such 
an instrument as the Teacher-Pupil In- 
ventory will contribute, in a small but 
significant way, to this end. 
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I STATEMENT OF THE PROBLEM* 


HE LITERATURE on reasoning reveals 
IS. variety of methods used or usable 
by subjects during problem-solving. Al- 
though some of the methods are reported 
as having been experimentally observed 
in psychological studies, other methods 
appear to be personal reflections on how 
problems could be solved. Whereas the 
former could actually be either effica- 
cious, indifferent, or detrimental to 
successful solution, the latter are im- 
plicitly presumed to be advantageous. 
Nine of these “methods of attack” have 
been selected for investigation in this 
experiment, in an attempt to determine 
their use and efficacy for a variety of 
problems. 

There are three questions with which 
we shall deal: 

1. How do individuals differ in the 
methods they use to solve problems? 

2. To what extent are these methods 
used in solving problems? , 

3. To what extent are different meth- 
ods efficacious? 

The nine methods selected for investi- 
gation are described below. 

1. Clear formulation of the problem. 
Dewey early emphasized the importance 
of awareness of the specific problem. In 
his first edition of How We Think, he 
discussed as the first 2 of his 5 steps in 
reflective thinking: (а) a felt difficulty, 
and (b) its location and definition. 

2. Preliminary survey of all aspects of 
the presented material. The Gestalt 
School has stressed the role of survey of 
the whole field—the situation as a whole. 

* In accordance with new editorial policy, the 
review of the literature on reasoning, forming 
the background of this inquiry, is omitted in 
this monograph, but is a part of the original 
dissertation, available on interlibrary loan from 


the Northwestern University Library, Evanston, 
Illinois, 


And Durkin (6) found that solution was 
facilitated by initial familiarity with all 
the parts of her construction-puzzles. 

3. Analysis into major variables. Sea- 
shore (17, р. 554) writes, "In any large 
and difficult problem, one of the first 
steps is to analyze out its main variables, 
which may then be attacked one at a 
time, in simpler fashion.” Here the 
problem-solver might ask himself this 
question: "What are the principal fac- 
tors which determine or influence the 
solution of this problem?" It is recog- 
nized that the problem-solver may utilize 
variables not helpful for solution. 

4. Locating a crucial aspect of the 
problem: "Loci" (Ruger) "Direction" 
(Maier) "Analysis" (Heidbreder). The 
subject might question: “What part of 
the problem seems to merit special atten- 
tion as the central factor relative to solu- 
tion?" Ruger found his subjects becom- 
ing aware of the "locus" of difficulty. 
Maier demonstrated the importance of 
the “way the problem is attacked." How- 
ever, Maier also warns against "habitual 
behavior" or "persistence" as false direc- 
lion, wrong attack, a side-track away 
from correct direction leading to solu- 
tion. And Heidbreder suggested that the 
function of analysis is to reduce the sup- 
ply of stimulus-content by singling out 
possibly-significant features. As with 
major variables, the problem-solver may 
pursue aspects "crucial" to him, but 
actually not helpful to solution. 

5. Application of past experience. 
The significance of applying past ex- 
perience ("transfer") is a commonplace 
(Barlow, Chant, Kinnamon, Lindworsky, 
Lyon) Drever (5) concluded that the 
application is not necessarily conscious. 

6. Varied trials. By definition, repeti- 
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tion of the same kind of trial could not 
be successful unless it was correct the first 
time (also note the previously mentioned 
warning of Maier against persistence in 
an unsuccessful direction). Variability 
has been established as one of the prime 
factors conducive to solution (Bain, 
Durkin, Lloyd Morgan, Thorndike, J. C. 
Peterson, et al.). 

7. Control. In experimentation, con- 
trol (variation, isolation, repetition) is 
foundational. In solving a problem, the 
question can be put: "How can I control 
each factor involved so as to determine 
its effect and relative influence?" 

8. Elimination of sources of error. The 
literature reveals scarce mention of the 
role of this factor. What sources of error 
(for instance, unsteadiness in a delicate 
mechanical puzzle) may be eliminated or 
controlled so as to facilitate solution of 
the problem? As with other methods (sec 
$ and 4, above), the problem-solver may 
eliminate factors which are not actually 
sources of error. 

9. Visualization. Chen, Comstock, 
Eidens, Pear, Storring, Wilhelm, and 
many others have demonstrated the role 
of imagery in reflective thinking. In be- 
havioristic terms imagery may be con- 
sidered as the use of implicit trials. 


Even at the outset, it appeared that 
some overlapping exists among certain 
of these methods, for example “analysis 
into major variables” and “locating a 
crucial aspect of the problem.” 

The subjects were 25 students en- 
rolled in an advanced undergraduate 
course in psychology. It was recognized 
that any conclusions concerning efficacy, 
and especially extent of use, of specific 
methods would be based on а highly 
select group of subjects, and that the ex- 
periment should be considered only as 
an exploratory study opening the way to 


systematic study of problem-solving in 


the general population. 

Three problems were chosen, not as 
representative of typical human prob- 
lems, but because their simplicity would 
permit some objective analysis relatively 
uncomplicated by the non-intellectual 
(e.g., emotional) factors often affecting 
daily problem-solving. The three prob- 
lems, involving induction, deduction, 
and geometrical-figure analysis, are de- 
scribed in the next section. 

Our problem, then, was to determine 
the use and efficacy of nine known 
“methods of attack” on different prob- 
lems. 
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II. THE PROBLEM-MATERIALS 


А. Tue INDUCTION PROBLEM 
1. Construction 


HE INDUCTIVE PROBLEM, largely concept-formation (devised by Professor Thur- 
Tua is presented in [ull, below. Five groups of letters are presented, with four 
groups always containing something in common; the subject is to detect the group 
not containing the feature common to the other four groups. Exclusive of the prac- 
tice-problems, 19 such problems were given. 

LETTER-GROUPING 


In the five groups of letters below, notice that four of the groups have something in common. 
The one which is not like the other has been marked. 


YLNPJ XBPZR YNTFH 


YRGSW YHDLY 


Four of the groups begin with the letter Y. The one which does not begin with Y is marked. 
Look at the next problem. 


AAXVP FNTBB HQTQX JDSWR MECTE 


Four of the groups have a repeated letter. The first, second, third, and fifth groups have the letters 
AA, BB, QQ, and EE respectively, The fourth group does not have a repeated letter. It is unlike 
the other groups so it is marked. 

Work the next two problems. Put a mark under the group which is different in each case. 
ABCDE XYZPS EFGHI CXVTN PJKLM 
EDCBA VWXYZ IHGFE ¢ JKLMN SBTGK 

In the first problem four of the groups have letters occurring in alphabetical order. You should 

have put a mark under the fourth group. 

In the second problem four of the groups are in alphabetical order, left to right or right to left, 

The fifth group should be marked. 
Look at the next two problems, Try to find which one should be marked. 


BDHOX GXMOT LEZTQ NXUFS 
KCUPE UMBXA APIQT LNUKH 


In the first problem four of the groups each contain a vowel. The first group should be marked. 
In the second problem four of the groups have two vowels. Group five should have been marked. 
Now work the next two problems. In each case put a mark under the group which is different. 


JPIDX KPHMA LPZSU MPLJE NPRWO 
PZĎDD SSSPB JPPPN HNNNX ВОВЕВ 


In the first problem four of the groups have the vowel in the last place. The first group should 


be marked. В А ; 
In each group of the second problem there is a letter occurring three times, In four of the groups 


the three identical letters are together. The last group should be marked. 


Work the problems which follow. 


1. VBCDF WQRSH XIJKF YLMNF ZABCF 
2. ELMNE FABCF GOPOG HZTMH IVWXI 
3. LGBGM TBKAK ZDNEN FRLLC VPSPW 
4. XVTQZ BFDHJ HDFBJ JBFDH HJFBD 
5. BCAHV BCOHV DZELM DKILM DTCLM 
6. FCRCT CJCCL CPCSC CCRCQ CKCFC 
7. BWBRB BBCBS RBMBB BNBBV XBBZB 
8. BQHTM AUIEO ZDKRN TMPZB HXKQN 
9. EFGHI EDCBA UTSRQ ALRXI OPORS 
10. CEDGH FAGHB DKCES LMCED MPCEK 
11. CFAGG PESRP VROVF NOMSN PROTS 
12. ONMLK LKJIH UTSRQ GKMOQ HGFED 
| 13. ALMNA PBCDP EVRHE UJKLU IDVLI 
14. CCPPR MSSMR TRRTR JNNJR WWDDR 
| 15. BSQVM TDPDX GHGRG FPSFR MPMBM 
10 16. АСТАМ ABSPQ CDTGH PQBXY ас 
IEG 17. JKLMV BPQRS KNWXY BODQT CXTRF 
| 18. FGHLM CDFGH JKMNO STVWX POSTU 
19. VMSAT DGMSB SPLOM QTMSC MSDHP 


°° 
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2. Administration 


This group of problems was adminis- 
tered individually to 20 of the 25 sub- 
jects (5 students reported familiarity 
with this induction-test; their results 
were excluded). As per directions on the 
form, the subject first worked all of the 
practice-problems. Starting with the last 
set of (five) practice-problems, the subject 
was questioned, as soon as he finished 
each one, as to why he had selected the 
one group of letters which he had 
marked. His answer was recorded as 
either “correct,” or in full if he had 
marked the right group but gave an in- 
correct reason or if he had marked the 
wrong group and gave some reason for so 
doing. After he was finished, the follow- 
ing four questions were asked so a to 
detect evidence of use of methods of 


attack: 
1. What are you supposed to do in 


these problems? (To detect "Clear formu- 
lation of the problem.") 

2. Try to list all the ways you can re- 
member in which four groups had some- 
thing in common lacking in the fifth 
group. (To detect "Analysis into major 
variables.") 

3. Have you ever had any previous ex- 
perience with a similar kind of problem? 
(If yes:) Were you able to apply this past 
experience here? How? (To detect “Ар- 
plication of past experience.") 

4. Do you remember anything else con- 
cerning the procedure you used which 
you believe was helpful in solving these 
problems? (To detect any other methods 
of attack.) 

Since the three specific questions (ex- 
clusive of the last general question) 
tested for only three methods, we relied 
upon careful examination of the sub- 
ject’s worksheet to detect evidence of 
other methods used. It was also recog- 


nized that cach question was only опе 
way of testing for use of any particular 
method of attack, that subjects unable to 
state use of a method may use the meth- 
od, perhaps unconsciously, and that 
other questions and ways of detecting a 
specific method could be devised; these 
limitations are discussed below, under 
“Discussion” and "Summary and Con- 
clusions." As with the other problems ad- 
ministered, the subject was allowed as 
much time as he wished for these induc- 
tion-problems. 


B. Tue DEDUCTION PROBLEM 
1. Construction 


'The deductive problem (selected as 
one of i5 items in one of Professor 
Thurstone’s Deductive Reasoning Tests) 
was presented in printed form, as fol- 
lows: 


(a) Captain Watts and his son James have been 
found shot—the father in the chest and the son 
in the back. (b) Both clearly died instantancously. 
(c) А gun fired close to the person—as, for 
example, when a тап shoots himscelf—will 
blacken and even burn the skin or clothes; fired 
from a greater distance it will leave no such 
mark. (d) The two bodies were found near the 
middle of a large hall used as a rifle range. (e) 
Its floor is covered with damp sand which shows 
every footprint distinctly. (f) Inside the room 
there are two pairs of footprints only, (g) A third 
man standing just outside the door or window 
could aim at any part of the room, but the pave 
mene outside would show no footmarks. (/) 

nder Captain Watts' body was found a gun: 
нр weapon was found near James. (i) In cach 
Ыш =. diee the bullet entered, was 
singed "s 1 Eonpowdes and the cloth a little 
and = аре W atts was devoted to his son 
ould have died sooner than harm him 

Deg neg hence it is impossible to suppose that 
a р him deliberately, even in self-defense. 
ut some think that James secretly disliked 


on and hoped to inherit his fortune at his 


a. To what was Captai: 
dass 1 s tain Watts’ death due? 
Murder? Accident? Suicide? 


b. To what was J. 
a ames’ death o? [jp AC 
cident? Suicide? ї due? Murder? 
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2. Administration 

This problem was administered indi- 
vidually to all 25 students. "Then the fol- 
lowing questions were asked: 

1. After reading the problem-questions 
at the end of the story, did you re-read 
the entire paragraph, keeping in mind 
constantly the three possibilities of 
murder, accident, suicide? (Since it early 
became evident that all subjects under- 
stood the problem set to them here, we 
used this question as probing for the 
possibility of extreme fixation of the 
answer-possibilities, or extremely vivid 
"Formulation of the problem.") 

2. How many times did you read the 
whole paragraph? (To detect careful 
"Preliminary survey of all aspects of the 
presented material.") 

з. Did you conclude that certain bits 
of given information were important or 
relevant, either directly or indirectly, for 
solving this problem? If so, consult the 
paragraph, and list all such points. (To 
detect “Locating a crucial aspect of the 
problem.") 

4. Have you ever had any previous 
experience with a similar kind of prob- 
lem? If so, were you able to apply this 
past experience here? How? (To detect 
"Application of past expcrience.") 

5. Did you try to apply the possibility 
of each of the three causes—murder, acci- 
dent, suicide—to the death of Captain 
Watts—and then to the death of James— 
to determine whether the cause could 
have been murder, could have been acci- 
dent, could have been suicide? (To de- 
tect “Varied trials.") 

6. If you re-read the paragraph, did 
you single out each sentence to try to 
determine whether any of the various 
sentences made impossible one or two of 
the three possible causes? (To detect 
"Control.") 
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7. Consult the paragraph for the fol- 
lowing questions: 

a. Was there any information in the 
paragraph which makes the following 
sentence not helpful in solving the 
deaths: "A third man standing just out- 
side the door or window could aim at 
any part of the room but the pavement 
outside would show no footmarks." 

b. Was there any information which 
indicated whether Captain Watts or 
James died first? 

c. Was there any information which 
made murder improbable for James’ 
death? 

d. Was there any information which 
made suicide improbable for James’ 
death? (To detect “Elimination of 
soutces of error.") 

8. Did you find yourself visualizing the 
various persons and events? (To detect 
“Visualization.”) 

9. Do you remember anything clse 
concerning the procedure you used? (To 
detect any other methods of attack.) 


C. Tur GEOMETRICAL-FiGURE. PROBLEM 


1. Construction 


The geometricalfigure problem (sug- 
gested by Lindley's (10) study) is pre- 
sented in Figure 1. As with many other 
similar figure-problems, this figure is to 
be drawn without going over any line 
already drawn and without taking the 
pencil off the paper. Starting at any 
point except either of the two inner 
diagonal points makes solution im- 
possible. Fourteen trials were possible on 
the sheet. 


2. Administration 


This problem was administered indi- 
vidually to 24 of the 25 students (one 
subject accidentally started at the correct 
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starting-point; his paper was excluded). 
The subject was allowed 14 trials. If he 
solved the puzzle on or before the four- 
teenth trial, he was asked to continue, 
and was stopped after making 3 successes 
in a row. The experimenter kept a rec- 
ord of the starting and finishing points 
for each trial. Then the following ques- 
tions were asked: 

1. Did you assume at the outset that 
the directions required that you start at 
some specific corner? (Since pre-experi- 
mental investigation had revealed that 
many subjects started at the upper left- 
hand corner, we included this question 
to detect any invalid "Formulation of the 
problem.") 

з. Before beginning the puzzle, did 
you make a preliminary survey ог, all 
aspects of the pictured puzzle, or did you 
start drawing after a glance at the 
puzzle? (To detect "Preliminary survey 
of all aspects of presented material.") 

3. Did you conclude at any time that 
some special part of the puzzle was best 


to start at? I so, what part? (To detect 
“Locating a crucial aspect of the prob- 
lem.") 

4. Have you ever had any previous ex- 
perience with a similar kind of problem? 
If so, were you able to apply this past 
experience here? How? (To detect “Ap- 
plication of past experience.") 

5. Did you discover certain moves 
which made solution impossible, or 
moves which you decided to avoid? If so, 
what moves or combination of moves? 
(Io detect "Elimination of sources of 
error.") 

6. Did you ever visualize the various 
possible next-moves before even tracing 
out these moves on the paper? If so, 
about how many moves ahead did you 
visualize on the average? (To detect 
“Visualization.”’) 

7. Do you remember anything else 
concerning the procedure you used 
which you believe was helpful in solving 
this puzzle? (To detect any other meth- 
ods used.) 


y 


III. RESULTS 


ECAUSE it would break continuity to 

list the findings on extent of use for 
each of three problems and then have to 
return to the problems for data on 
efficacy, we present here the complete 
results for each of the three problems. In 
the "Discussion" of this paper, we shall 
return to our two main questions: use 
and efficacy of 
attack, 


various methods of 
A. Tue INDUCTION PROBLEM 


Table 1 shows the distribution of 
scores. The range is small probably be- 


TABLE т 


No. of 
Subjects 


Mean 
Stand. Dev. 


cause the items, selected from a speed- - 


test, were taken in unlimited time and 
were solved, in most cases, by almost all 
the subjects. 


1. The Methods of Attack Used 


As contrasted with the other two prob- 
lems to be discussed, the induction prob- 
lem revealed only three distinct methods 
of attack. 

(a) Clear formulation of the problem. 
The initial instructions read: "In the 
five groups of letters below, notice that 
four of the groups have something in 
common. The one which is not like the 
others has been marked. 

YLNPJ XBPZR 


YNTFH 
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Four of the groups begin with the letter 
Y. The one which does not begin with 
Y is marked.” 

The other examples given (8 alto- 
gether) were similar. For each problem, 
four of the five groups have something 
in common lacking in the remaining 
group. It must be noted that the fifth 
group must be not merely different, but 
different in the sense of lacking a char- 
acteristic common to the other groups; 
each group was probably different from 
the other four in the sense of containing, 
say, some letter absent from the other 
four groups, but this kind of difference 
was not the required kind. The failure 
to,make the distinction between merely 
being different and being different in the 
sense of lacking a characteristic common 
to four groups became evident in the re- 
sults, A majority of the subjects formu- 
lated the problem simply as: Find the 
group which is different. We have con- 
sidered this as an "incomplete" formula- 
tion. 

Only 40% of the subjects correctly 
formulated the problem, verbally. How- 
ever, incomplete verbal formulation did 
not seem to involve actual incomplete 
formulation during the working of the 
problems, as can be observed by the fact 
that even the lowest score was: 14 correct 
out of a possible 19, or 74% correct 
(chance, for 5 choices, would permit 20% 
correct); it seems axiomatic that success- 
ful solution of this kind of problem 
would require understanding the task. 
Apparently the nature of the task was 
evident in spite of the incomplete verbal 
formulation. This difficulty of mistaking 
inability to state a method for failure to 
detect or use a method was anticipated 
YRGSW YHDLY 
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(see Administration of the Induction 
Problem, above), and continued to ob- 
scure the use of methods in all three 
problems, as further described in this 
paper. Incomplete formulation led, how- 
ever (in a few cases), to the kind of error 
which might be expected: selecting a 
group which was merely different. The 
three examples found are given below: 


SUBJECT RLD: BCAHV BCOHV 
The subject's reason: “The others do not have 
a letter Z." Note that to carry out his incorrect 
formulation of the problem at this point the 
subject would be obliged to mark the first group 
as alone containing an A, the second group as 
alone containing an O, the fourth group as alone 
containing a K, and the last group as alone 
containing a Т. Selecting the third group be- 
cause it alone contained the letter Z indicates 
incomplete (if not incorrect) formulation. This 
subject's verbal formulation was: “Pick the опе 
group that is different from the other four 


groups." 


SUBJECT MN: MSSMR 


CCPPR 

Although the group marked is the correct one, 
the subject's reason is wrong, close however to 
borderline correctness: “This group has the same 
letter three times.” The correct reason is: The 
other four groups each have one and only one 
letter R, or (another reason): The other four 
groups each have three different letters. This 
subject's verbal formulation was: “To find the 
group which is different.” 


SUBJECT MEM: BSQVM TDPDX 

The subject's reason: “The other four groups 
do not have any letter repeated three times.” 
As a matter of fact her statement is false, for 
the third group has the letter G three times. 
But even if her statement were true, her reason 
indicates the same reversal of the problem-task: 
she finds a characteristic absent from four groups 
when the instructions require her to find a 
characteristic present in four groups. Her verbal 
formulation was: “Underline the one that is 
different in any respect from the other four 
groups.” 


(b) Preliminary survey of all aspects 
of the presented material. The particular 
nature of this problem required every 
subject to survey all 5 groups of letters; 
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this factor was found in 100% of the 
subjects. 

(c) Analysis into major variables. Our 
question asked the subject to list all the 
ways in which four groups had some- 
thing in common lacking in the fifth 
group. Analysis of all the problems (8 
practice-problems and 19 test-problems) 


DKILM 


DZELM DTCLM 


gave the following list as including every 
possible reason for the 27 problems that 
each subject encountered: 


Four groups have some common letter (or 2, 
3, 4, Or 5 letters). 

Four groups have only one of a letter common 
to all five groups. 

Four groups have 2 given letters adjacent. 

Four groups have some common letter in a 
constant position (say, fourth in the group 
of letters). 


TRRTR 


JNNJR WWDDR 
Four groups have some vowel (or 2 vowels). 
Four groups have some vowel in a constant 

position. 

Four groups have five consonants. 
Four groups have three different letters. 


Four groups have a repeated letter (or a 
letter three times). 
GHGRG FPSFR MPMBM 


Four groups have a repeated letter adjacent 
(or separated). 


Four groups have a repeated letter separated 
by one letter, 


Four groups have an alphabetical sequence 
(of 9 1, Or 5 letters). 
Four groups have 2 alphabetical sequences of 


2 and 3 letters, respectively. 


Analysis of these thirteen ways in 
which four groups had something in 
common further revealed 8 major vari- 
ables basic to all such problems: 

Letters. 

Vowels. 


Consonants. 


METHODS OF ATTACK ON REASONING PROBLEMS 9 


Position of a letter. 

Alphabetical sequence. 

Adjacency or separation of 2 given let- 
ters. 

Quantity of a given letter. 

Quantity of different letters within a 
group. 

We credited a subject for each such 
variable stated or indicated in his list of 
"ways in which four groups had some- 
thing in common." 

Each of the 8 variables was acknowl- 
edged by one or more subjects. Some 
subjects were more thorough in formu- 
lating a variable; for example, some re- 
membered and stated that alphabetical 
sequence could be either forward or 
backward. 

The distribution of the number of 
major variables reported is shown in 
Table 2. 

Although other methods of attack may 
have been employed by the subjects, no 
other methods revealed themselves as 
having been used on this problem. Ex- 
perience with this problem leads us to 
suggest the following reasons for the fact 
that only three methods appeared for 
this problem: 

1. Locating a crucial aspect of the 
problem: Here, it seems (to the writer) 
that this particular problem does not 
contain a "crucial aspect" (further ex- 
plained in the “Discussion”). 

2. Application of past experience: 


TABLE 2 
Major Variables Nuniber of 
Reported Subjects 
Pose: ds tate sesar pa MAP ВЕ PATE Me Bese E 4 


.6 variables 
-97 


Mean 
Stand. Dev. 


The 20 subjects had reported, before the 
experiment, that they had not previously 
worked problems of this kind. 

3. Varied trials: Variation could mean 
several things in this problem. (a) If it 
be considered as referring to the subject's 
questioning each group's lack of a char- 
acteristic common to the other four 
groups, then the small amount of stimu- 
lus-material probably permitted every 
subject to try all five groups, time being 
unlimited. (b) If variation be considered 
as referring to trying various generaliza- 
tions (of characteristics possibly com- 
mon to four of the five groups), then 
this “method” becomes synonymous with 
"Analysis into major variables," re- 
ported, above, for this problem. 

4,, Control: Same as for “Varied trials." 

5. Elimination of sources of error: As 
with (1) above, the writer doubts that 
this factor is adequately meaningful for 
this problem. 

6. Visualization: The stimulus-material 
itself is visual in nature. 

The full implications of the apparent 
absence of certain methods on certain 
problems will be more completely treated 
in the‘‘Discussion.” 

The range of the total number of 
methods used per subject was from two 
to three, with the mean at 2.4 methods 
per subject. 

A question of special interest was: 
Why did our subjects make the various 
errors we found on their work-records? 
We have analyzed the 19 errors made, in 
relation to the reasons the subjects gave 
in each case for marking the group of 
letters they selected: 


Errors 
Perceptual (visual) errors: 12 (63%) 
Wrongly formulated problem-task: 3 (16%) 
Miscellaneous (hunches, etc): 4 (21%) 
Total: 19 
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63% of the errors were perceptual mistakes. We list hete some examples: 


VBCDF WQRSH 


XIJKF 


YLMNF ZABCF 


SUBJECT PHK: "Because the other groups have the three middle letters in alphabetical se- 


quence." 
But so does the one she marked. 


LGBGM TBKAK 


ZDNEN 


FRLIC VPSPW 


SUBJECT KLJ: “Because the other four groups have one letter repeated.” 


But so does the one she marked. 
FGHLM CDFGH 

SUBJECT JAH: "Because this group is the 
only one with a vowel." 

But note the last group. 

16% (3 errors were the result of 
wrongly formulated problem-task and 
have been already listed and discussed 
above under "Clear formulation of the 
problem." 


2. The Efficacy of the Methods Used 

Following are the correlations between 
the methods used and the number of 
problems worked correctly. 

(a) Clear formulation of the problem. 
Clear verbal formulation correlated (bi- 
serial correlation) —.10 with problems 
worked correctly, so individual differ- 
ences in verbal formulation in this ex- 
periment do not seem to be related to 
successful solution. Apparently subjects 
understood the task in spite of inability 
to state the task clearly and correctly. 

(b) Preliminary survey of all aspects 
of presented material. All subjects used 
this factor, so no comparative evidence 
was available concerning its efficacy. 

(c) Analysis into major variables. The 
number of major variables reported 
correlated (product-moment correlation) 
+.51 (Р.Е. = .11) with problems worked 
correctly, so detection of major variables 
seems to be moderately correlated (posi- 
tively) with successful solution. 


В. Tur DEDUCTION PROBLEM 


Following is the distribution of scores 
on the problem concerning the deaths of 


JKMNO 


STVWX POSTU 


Captain Watts and his son James. 


Students 
Correct cause for both deaths: 15 (60%) 
Correct cause for one death: 10 (40%) 
Correct cause for neither death: o 


Given unlimited time, the majority, 
60%, solved the deaths of both Captain 
Watts and his son James. Of the minor- 


.ity, 4095, some ascertained the correct 


cause of Captain Watts' death, and some 
ascertained the correct cause of James 
death. 


1. The Methods of Attack Used 

All except one method (Analysis into 
major variables) were found. 

(а) Clear formulation of the problem. 
Apparently the nature of the task was 
self-evident, for all of the subjects stated 
the problem-task correctly. Two subjects 
seemed to have formulated the problem 


more vividly by extreme fixation of the ` 


answer-possibilities: these two subjects re- 
read the entire paragraph while keeping 
in mind the three possible causes (mur- 
der, accident, and suicide). 

(b) Preliminary survey of all aspects 
of the presented material. Table g shows 
the distribution of the number of times 
the whole clues-paragraph was read. 
Thus, about half of the subjects did not 
read through the entire paragraph eve? 
a second time, All of the subjects, how 
ever, referred back to individual parts of 
the paragraph. 

(с) Locating a crucial aspect of thé 
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TABLE 3 9 bility of each of the three causes—mur- 

x = der, accident, suicide—to the death of 
No. of No. of : 

Readings Subjects Captain Watts and then to the death of 

James—to determine whether the cause 

ө could have been murder, could have 

3 . sit + 

Б been accident, could have been suicide,’ 

i we found that 28% (7 subjects) answered 

N=25 "Yes" and 72% (18 subjects) answered 


Mean — 1.8 readings 


problem. The question asking the sub- 
jects to list each item of information im- 
portant to solution involved every item 
except two: sentence (g) concerning a 
third man, and sentence (k) concerning 
James’ dislike of his father. Of the g 
important points, Table 4 shows the dis- 


TABLE 4 


No. of Important No. of 
Points Reported Subjects 


бан 


| own 


ы 
p 


Mean —6.7 points 
Stand. Dev.— 1.26 


tribution of the number of points ге- 
ported by the subjects. As an average, 
the subjects reported more than % of the 
important items of information. 

(d) Application of past experience. 
Only 3 subjects reported such applica- 
tion. The remarks are quoted: 

SUBJECT MEM: "Yes, like the fact that a 
man can't shoot himself in the back." 

SUBJECT SEC: "Yes, I have built up a habit 
of considering every statement." 


SUBJECT RMS: "Yes, the value of reviewing 
the facts by reading them again." 


(e) Varied trials. On the question “Did 
you try to deliberately apply the possi- 


"No." Thus only about 14 of the sub- 
jects felt that they had tried to consider 
the possibility of each of the three causes 
for each man's death. 

(f) Control. On the question’ “If you 
re-read the paragraph, did you single out 
each sentence to try to determine 
whether any of the various sentences 
made impossible one or two of the three 
possible causes" we found that 44% 
(11 subjects) answered "Yes" and 56% 
(14 subjects) answered “No.” Thus a 
little less than half the subjects reported 
the use of this particular method of 
control. 

(g) Elimination of sources of error. 
'The four questions testing for ability to 
detect and eliminate sources of error in- 
volved a total of 8 items in the para- 
graph. Table 5 shows the distribution of 
the number of points correctly reported 
by the subjects. Thus, of a maximum 
possible of 8 items, the subjects, as an 
average, detected half the items. 

(h) Visualization. 84%, (21 subjects) 
reported visualization of the persons and 
events in the story, while 16% (4 sub- 
jects) reported absence of visualization. 

The general question (concerning use 
of other methods) brought forth remarks 
by only three subjects. The first subject, 
on noticing that she was to solve a “de- 
tective story," said that she had a good 


1 Here, as in the other questions probing for 
use of methods, the particular question asked 
is, of course, only one way of searching for 


methods, in this case for control of individual 
clues. 
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63% of the errors were perceptual mistakes. We list here some examples: 


VBCDF WORSH XIJKF YLMNF ZABCF 

SUBJECT PHK: “Because the other groups have the three middle letters in alphabetical se- 
quence.” 

But so does the one she marked. 

LGBGM TBKAK ZDNEN FRLIC VPSPW 
SUBJECT KLJ: “Because the other four groups have one letter repeated.” 
But so does the one she marked. 

FGHLM CDFGH JKMNO STVWX PQSTU 


SUBJECT JAH: “Because this group is the 
only one with a vowel.” 

But note the last group. 

16% (3 errors) were the result of 
wrongly formulated problem-task and 
have been already listed and discussed 
above under “Clear formulation of the 
problem.” 


2. The Efficacy of the Methods Used 

Following are the correlations between 
the methods used and the number of 
problems worked correctly. 

(a) Clear formulation of the problem. 
Clear verbal formulation correlated (bi- 
serial correlation) —.10 with problems 
worked correctly, so individual differ- 
ences in verbal formulation in this ex- 
periment do not seem to be related to 
successful solution, Apparently subjects 
understood the task in spite of inability 
to state the task clearly and correctly. 

(b) Preliminary survey of all aspects 
of presented material. All subjects used 
this factor, so no comparative evidence 
was available concerning its efficacy. 

(c) Analysis into major variables. The 
number of major variables reported 
correlated (product-moment correlation) 
+.51 (Р.Е, = .11) with problems worked 
correctly, so detection of major variables 
seems to be moderately correlated (posi- 
tively) with successful solution. 


B. Tur DEDUCTION PROBLEM 
Following is the distribution of scores 
on the problem concerning the deaths of 


Captain Watts and his son James. 


Students 
Correct cause for both deaths: 15 (60%) 
Correct cause for one death: 10 (40%) 
Correct cause for neither death: о 


Given unlimited time, the majority, 
60%, solved the deaths of both Captain 
Watts and his son James. Of the minor- 


‚ ity, 40%, some ascertained the correct 


cause of Captain Watts' death, and some 
ascertained the correct cause of James’ 
death. 


1. The Methods of Attack Used 


All except one method (Analysis into 
major variables) were found. 

(a) Clear formulation of the problem. 
Apparently the nature of the task was 
self-evident, for all of the subjects stated 
the problem-task correctly. Two subjects 
seemed to have formulated the problem 


more vividly by extreme fixation of the ` 


answer-possibilities: these two subjects re- 
read the entire paragraph while keeping 
in mind the three possible causes (mur- 
der, accident, and suicide). 

(b) Preliminary survey of all aspects 
of the presented material. Table 3 shows 
the distribution of the number of times 
the whole clues-paragraph was read. 
Thus, about half of the subjects did not 
read through the entire paragraph even 
a second time. All of the subjects, how- 
ever, referred back to individual parts of 
the paragraph. 

(c) Locating a crucial aspect of the 
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TABLE 3 * 
No. of No. of 
Readings Subjects 
3 
8 
12 
N=25 


Mean =1.8 readings 


problem. The question asking the sub- 
jects to list each item of information im- 
portant to solution involved every item 
except two: sentence (g) concerning a 
third man, and sentence (k) concerning 
James dislike of his father. Of the 9 
important points, Table 4 shows the dis- 


TABLE 4 
No. of Important No. of 
Points Reported Subjects 
1 
5 
10 
5 
3 
o 
1 
25 


Mean =6.7 points 
Stand. Dev.=1.26 


tribution of the number of points re- 
ported by the subjects. As an average, 
the subjects reported more than 24 of the 
important items of information. 

(d) Application of past experience. 
Only 3 subjects reported such applica- 
tion. The remarks are quoted: 

SUBJECT MEM: "Yes, like the fact that a 
man can't shoot himself in the back." 

SUBJECT SEC: "Yes, I have built up a habit 
of considering every statement." 

SUBJECT RMS: “Yes, the value of reviewing 
the facts by reading them again.” 

(e) Varied trials. On the question “Did 
you try to deliberately apply the possi- 


bility of each of the three causes—mur- 
der, accident, suicide—to the death of 
Captain Watts—and then to the death of 
James—to determine whether the cause 
could have been murder, could have 
been accident, could have been suicide,” 
we found that 28% (7 subjects) answered 
"Yes" and 72% (18 subjects) answered 
"No." Thus only about 1⁄4 of the sub- 
jects felt that they had tried to consider 
the possibility of each of the three causes 
for each man’s death. 

(f) Control. On the question’ “If you 
re-read the paragraph, did you single out 
each sentence to try to determine 
whether any of the various sentences 
made impossible one or two of the three 
possible causes,” we found that 44% 
(11 subjects) answered "Yes" and 56% 
(i4 subjects) answered "No." Thus a 
little less than half the subjects reported 
the use of this particular method of 
control. 

(g) Elimination of sources of error. 
The four questions testing for ability to 
detect and eliminate sources of error in- 
volved a total of 8 items in the para- 
graph. Table 5 shows the distribution of 
the number of points correctly reported 
by the subjects. Thus, of a maximum 
possible of 8 items, the subjects, as an 
average, detected half the items. 

(h) Visualization. 84% (21 subjects) 
reported visualization of the persons and 
events in the story, while 16% (4 sub- 
jects) reported absence of visualization. 

The general question (concerning use 
of other methods) brought forth remarks 
by only three subjects. The first subject, 
on noticing that she was to solve a “de- 
tective story," said that she had a good 


1 Here, as in the other questions probing for 
use of methods, the particular question asked 
is, of course, only one way of searching for 


methods, in this case for control of individual 
ciues. 
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TABLE 5 
No. of Points No. of 
Reported Subjects 
3 
7 
8 
5 
I 
o 
T 
N=25 


Mean =4.1 points 
Stand. Dev.=1.32 


imagination. Her solution would be 
satisfactory except for ignoring the gun- 
powder-burns on the back of James’ 
coat: “Captain Watts accidentally shot 
himself in the chest. The bullet went 
through his body and hit James, stand- 
ing behind him, in the back.” - 

The remarks of the other two subjects 
(both of whom solved the problem cor- 
rectly) suggest, again, that subjects may 
not be aware of using methods and may 
not be aware of how they actually solve 
a problem: "Just reading and restating 
the points seemed to suggest solution," 
and "Reading the material suggested 
various possible solutions which I 
checked on." 

The only method not discovered was 
"Analysis into major variables." For this 
particular problem this method could 
refer to the three ways in which each 
man could have died (murder, accident, 
suicide); then the method would be 
synonymous with "Varied trials" re- 
ported on this problem. On the other 
hand, if the important clues are con- 
sidered as major variables, then this 
method would be synonymous with "Lo- 
cating a crucial aspect," reported on this 
problem. 

The range of the total number of 
methods used per subject was from 3 to 
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7, with the mean at 4.7 methods per 
subject. Of course this average conceals 
the qualitative differences in the usc of 
а given method by different subjects and 
also the quantitative difference in the 
extent to which a method was used. 


2. The Efficacy of the Methods Used 


The following tables and bi-serial cor- 
relations show the relationship between 
the use of a method and the success or 
failure in solving the two deaths. 

(a) Clear formulation of the problem. 
All subjects seemed to formulate the 
problem clearly, so no measure of 
efficacy was possible. It might be noted 
that the two subjects who formulated 
the problem quite vividly (by fixating 
the three answer-possibilities while re- 
reading the paragraph) both solved the 
problem. 

(b) Preliminary survey of all aspects 
of the presented material. The number 
of times the subject read the clues-para- 
graph correlated +.22 with successful 
solution, so complete re-survey seems 
slightly related to successful solution. 

(c) Locating a crucial aspect of the 
problem. 'The number of important 
points detected and reported correlated 
+.39 (P.E. = .15) with successful solu- 
tion, so detection of this kind of crucial 
aspect seems slightly related to successful 
solution. 

(d) Application of past experience. 
Only 3 subjects reported the direct appli- 
cation of past experience (2 solved the 
problem), so no measure of the efficacy 
of past experience was feasible. 

(e) Varied trials. In interpreting the 
following percentages (in Tables 6, 7, 
and 8) it must be noted that 60% of 
the entire group solved the problem. 
Therefore the percentage of users of the 
method “Varied trials" must be compared 
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TABLE 6 TABLE 8 
І Solved | Failed | Solved. | Failed. 
(by 60% | (by 40% | (by 60% | (by 40% 
of the of the | of the of the 
subjects) | subjects) | subjects) | subjects) 


7 subjects who tested 
three possible causes 
18 subjects not testing 
three possible causes 


5 (71%) | 2 (29%) 
то (56%) | 8 (44%) 


21 subjects who reported 


visualization i 12 (5790) | 9 (43%) 
4 subjects not reporting 
visualization 3 (75%) | 1 (2590) 


with 60%, not with 50% (as would be 
the case if equal numbers solved and 
failed the problem). Of the 7 subjects who 
tested each of the three possible causes, 
71% solved the problem, 1 1% more than 
might be expected, so (our measure of) 
variation seems slightly related to success- 
ful solution. 


TABLE 7 
— Solved | Failed 
(by 60% | (by 40% 
of the of the 


subjects) | subjects) 


11 subjects who tested the 


effect of isolated Hn 9 (82%) | 2 (1890) 
14 subjects not testing the i 
effect of isolated items | 6 (43%) | 8 (57%) 


(f) Control. Of the 11 subjects who 
isolated items and tested their effects, 
82% solved the problem, 22% more than 
might be expected, so this measure of 
control seems somewhat related to suc- 
cessful solution. 

(g) Elimination of sources of error. 
The number of items detected concern- 
ing sources of error correlated +.60 
(P.E. = лә) with successful solution, so 
our measure of detection and elimina- 
tion of sources of error seems to be mod- 
erately and positively correlated with 
successful solution. : 

(h) Visualization. Of the 21 subjects 
who reported visualization of the per- 
sons and events in the story, 57% solved 


the problem, 3% less than might be ex- 
pected, so visualization does not seem 
related to successful solution. 

The total number of methods used 
correlated +.48 (P.E. =.14) with success- 
ful solution, and thus was moderately 
and positively correlated with success in 
this problem. 

Thus, four methods seem somewhat re- 
latd to successful solution of this par- 
ticular problem: 

Locating a crucial aspect 

Varied trials 

Control 

Elimination of sources of error 


C. THe GEOMETRICAL-FIGURE PROBLEM 


Table 9 shows the distribution of the 
number of trials required to solve the 
problem, and includes the 6 subjects who 
failed to solve the problem in the 14 
trials allowed. 


1. The Methods of Attack Used 


This tracing-puzzle is so constructed 
that, by starting at either one of the two 
end-points of the inner diagonal, solu- 
tion is easy. But starting at any other of 
the 14 remaining points makes solution 
impossible. Of course it is possible that 
a subject will happen to start at one of 
the two correct starting-points; one sub- 
ject did start thus and was surprised at 
his early solution, and his paper was 
excluded from our results since no infor- 


who asked Whether they might erase 
their last line drawn; the directions spe- 
cified ". . . do not erase or change what 
you have drawn...” 

(b) Preliminary survey of all aspects of 
the presented material. 38% of the sub- 
jects reported that they had studied the 
puzzle before beginning to draw. In one 
case a subject pondered for half a minute 
before beginning the first trial. On this 
particular problem, the time spent in 
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TABLE 9 
5 No. of 
Trials Subjects 
4 
4 
I 
2 
I 
1 
I 
2 
2 
6 
N24 


Median =8 trials 


mation concerning work-methods was 
available. Of the nine methods of attack, 
only "Application of past experience" 
was not reported. 

(a) Clear formulation of the problem. 
АП of the subjects seemed to have formu- 
lated the problem clearly; apparently the 
task in this puzzle is self-evident, from 
the nature of the instructions employed. 
Although many subjects started (and 
tended to continue starting) at the upper 
left-hand point, they had not assumed 
that the directions required that they 
start at this corner; possibly the deeply 
ingrained reading-habit of starting at up- 
per-left may underlie this phenomenon 
(Lindley, 10). One special aspect of the 
problem-task, however, was found to be 
unclearly formulated by a few subjects, 


preliminary survey was very short; sub- 
jects tended to gain familiarity with the 
problem through overt trial-and-error 
rather than through close inspection or 
implicit (imagined) trial-and-error. 

(c) Analysis into major variables. 
Since starting at either of the two correct 
starting-points makes solution rather 
easy, there is only one major variable: 
correct starting-point, However, since it 
is possible to go into a blind-alley even 
after starting correctly, the avoidance of 
blind-alleys may be considered as a 
minor variable. The subjects reporting 
the major variable all achieved the same 
general idea: starting at the center (“it 
seemed best to start at the center and 
work around”). However, the minor 
variable was not so specifically formu- 
lated. Although most of the subjects 
who reported it simply warned against 
blind-alleys, others were more specific 


FIGURE 1. Geometrical-figure problem. 


Draw this figure, without going over any line already drawn and w 
paper. Fill in as many lines as you can. If you are unable to comp 
change what you have drawn, but start over again on a new 


complete figure successfully. 


ithout taking the pencil off the 
lete the figure, do not erase or 
figure. Continue until you draw a 


| 


її" 


di 


| 
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("Leave an open space near a partially 
drawn section to get back in later to com- 
plete the section." And another: "Don't 
start at any of the 4 outside corners."). 
17% of the subjects reported the major 
variable. 38% reported the minor vari- 
able (blind-alleys). And none reported 
both. 

(d) Locating a crucial aspect of the 
problem. Some subjects hit upon a “false 
crucial aspect"; they selected one point 
for starting and felt that solution de- 
pended only upon the care exercised 
after starting. Another false "crucial 
aspect" or "direction" consisted of de- 
tecting patterns: These subjects noticed 
the large octagon-pattern and tried to 
make it first, which necessitated starting 
at a point which was (unknown to the 
subject) detrimental to solution. Finally, 
some subjects conceived the idea of start- 
ing in the center, "to work around"; 
since two of the four center-points were 
correct starting-points, this was a partial 
localization. of the problem, and the 
subjects soon solved the puzzle. 50% of 
the subjects reported some "crucial" as- 
pect (including false ones). . 

(e) Application of past expertence. Al- 
though there were а few subjects who 
had worked a similar kind of puzzle in 
the past, none reported the applica- 
tion of any specific principle carried 
over from previous related experiences. 
Again, however, the criterion (verbaliza- 
tion) may have obscured minor aspects of 
carryover. Theoretically, the one great 
application that might have been made 
would be the notion of trying different 
starting points. Inspection reveals, how- 
ever, that many "similar" puzzles are so 
constructed that, by starting at any point 
and exercising care, one can solve the 
puzzle; if this is true of the geometrical- 
figure puzzles previously tried by the 


subjects, then no positive carryover of 
the major variable (correct starting 
point) in our puzzle could be expected. 
On the contrary, we should expect a dis- 
regard of the starting-point as a negative 
transfer to this problem. As reported 
below, under “Varied trials" and "Con- 
trol," many of the subjects started con- 
secutive trials from the same point, thus 
confirming the possibility of negative 
transfer from previous puzzles in which 
starting-point may have been unim- 
portant, This finding also suggests that 
the efficacy of methods of attack may not 
be general but conditional, depending 
upon the degree to which any given 
method makes clear the crucial major 
variable or variables basic to the prob- 
lem at hand. A similar finding is reported 
under efficacy of the factors of “Varied 
trials” and “Control.” 

(f) Varied trials, (g) Control. As ex- 
perimentally defined, "control" (‘“varia- 
tion, isolation, repetition”) has been 
broken down into two factors for this 
problem: (f) Varied trials (“variation” 
and (g) Control ("isolation, repetition"). 

Our only expedient measure of 
“Varied trials" was: the number of dif- 
ferent starting-points used, divided by 
the number of trials required for solu- 
tion. Thus, the highest possible variabil- 
ity would be 100% for a subject who 
varied his starting-point on each new 
trial. The lowest possible variability 
would be 7% (really representing no 
variability), for a subject who tried only 
one starting-point in 14 trials (1 — 14). 
It should be noted, however, that this 
measure of variability refers to variabil- 
ity of starting-point and that there re- 
mains the complex variability displayed 
after starting. This latter variability, im- 
portant no doubt to the subject, did not 
permit measurement; thus we were un- 
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able to establish a general measure of 
variability. 

Our measure of "Control" was: Total 
number of trials in which the same start- 
ing-point was used two or more times 
consecutively, divided by the number of 
trials. Thus, a subject who started at one 
point for three trials, then at another 
point for two consecutive trials, and then 
at another point for, say, five successive 
trials, has demonstrated 100% control 
(3 + 2 +s, divided by 10), in the sense 
that each new starting-point has been 
tried out for ? or more trials before being 
abandoned. A subject who varied his 
starting-point on each new trial would 
demonstrate zero control (and, it should 
be noted, a variability of 100%). Thus, 
by definition for our limited means ¿of 
measurement, high variability tended to 
go with low control (for this problem). 
However, since variation is an important 
aspect of control in the latter's full mean- 
ing and use, we have here considered our 
two measures as two aspects of the same 
principal factor. 

The results showed “Variability” rang- 
ing from 14% (only 2 starting-points in 
14 trials) to 100% (different starting- 
point on each trial), with the mean at 
56%. “Control” ranged from 09% 
(changed starting-point on each trial) to 
100% (each new starting-point tried for 
2 or more consecutive trials), with the 
mean at 51%. Two interesting examples 
of introspections are quoted, as indicat- 
ing personal differences among subjects 
in carrying out empirically the sensed 
awareness of the efficacy of variation: 

SUBJECT ALG (Variability 36%; Control 
43%): On starting her sixth trial at a new point, 
said, "Variety is the spice of life, so I'll start 
here." On starting her last possible trial (14th), 
she said, "Maybe if I start in the middle, some- 


thing might happen." She started at a correct 
point and solved the puzzle. 


SUBJECT RMS (Variability 14%; Control 
93%): This subject began her second trial at a 
different point, but continued starting at this 
point right through the next 19 trials, un- 
successfully as regards the solution. While work- 
ing on her seventh trial, she said, "Maybe if 
I start at a different place, I'd make it.” But she 
started her eighth trial at the same point again. 
On starting her tenth trial at still the same 
point, she said again, “Maybe I don't start at 
the right place.” But she began her eleventh trial 
at the same point again, as with the twelfth, 
thirteenth, and fourteenth (last). 


Thus, for our measures in this prob- 
lem "Control" seems to measure mainly 
persistence, with most of the variability 
partialled out and measured as “Varied 
trials." 

(h) Elimination of sources of error. 
42% of the subjects reported the identi- 
fication and elimination of one source of 
error, mainly: blind-alleys (the minor 
variable). An unusual source of error was 
encountered, and met, by one subject 
who on her second trial unknowingly 
omitted the center diagonal, finished the 
rest of the figure, and thought that she 
had solved the puzzle until reminded (by 
the experimenter) of her omission. To 
meet this source of error, before begin- 
ning her next trial she marked light 
crosses in the five places where diagonals 
had to be drawn; she encountered no 
difficulty thereafter. This, of course, also 
represented “Locating a (minor) crucial 
aspect.” 

(i) Visualization. The maximum num- 
ber of possible-next-moves visualized by 
the subjects ranged from 2 to 7, with the 
mcan at 4.4 moves. 

No other methods were revealed in the 
answers to the general question. 

The range of the total number of 
methods used per subject was from 3 to 
8, with the mean at 5.6 methods per sub- 
Ject. Again, we recognize that the mean 
of 5.6 blankets the fact that a given 
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method differed qualitatively (and also 
in quantitative degree) from one subject 
to another on the same problem. 


з. The Efficacy of the Methods Used 


The following tables show the relation 
between the use of a method and the 
number of trials involved in solution. 
The number of trials are presented in 
a two-fold classification: Superior subjects 
(below the median number of trials) and 
Inferior subjects (above the median 
number of trials). These small numbers 
of cases derived from a study mainly 
qualitative in design do not permit rigor- 
ous statistical treatment, but inspection 
of the results grouped into the 2 cate- 
gories listed reveals general tendencies 
toward efficacy or поп-ећсасу of various 
methods. 

For 3 methods where a rank-difference 
correlation was feasible (although no 
measures of reliability were obtainable), 
the correlation is presented. 

(a) Clear formulation of the problem. 
All subjects seemed to formulate the 
problem clearly, so no measure of efficacy 
was possible. 

(b) Preliminary survey of all aspects of 
the presented material. Inspection reveals 
that among the subjects who reported pre- 
liminary survey there were about equal 
numbers of superior and inferior subjects. 


TABLE то 


Number of Trials 
Involved in Solution 


12 Supe- | 12 Infe- 
rior rior 
Subjects | Subjects 
(3 to 7 |(9 trials to 
trials) | “failed’’) 


And among those not reporting prelimin- 
ary survey, there were also about equal 
numbers of superior and inferior subjects. 
A preliminary survey of all aspects of the 
stimulus-material (in the minor degree 
to which it appeared at all) did not seem 
helpful for solution of this particular 
problem. 

(c) Analysis into major variables. The 
majority (5 out of 4) of those reporting 


TABLE rr 


Number of Trials 
Involved in Solution 


12 12 
Superior | Inferior 
Subjects | Subjects 


Detected major variable 


(N=4) 3 І 
Dècected minor variable 

(N—9) 3 6 
Detected neither variable 

(N=11) 6 5 


the major variable were superior sub- 
jects. More inferior subjects reported the 
minor variable (unimportant in this 
problem), and about equal numbers of 
superior and inferior subjects reported 
neither variable. Awareness of the major 
variable seems to be related positively 
to successful solution. 

(d) Locating a crucial aspect of the 
problem. Of those who had an hypothe- 
sis which they considered crucial, there 
were more inferior subjects. And of those 
lacking such an hypothesis there were 


TABLE 12 


Number of Trials 
Involved in Solution 


12 Superior] 12 Inferior 


Subjects making prelimi- Subjects | Subjects 
nary survey (N =o). , 5 4 gas i 

Subjects omitting prelimi- : Hypothesis (N= 12) 5 ; 
nary survey (N = rs) 7 8 No hypothesis (N = 12) 7 | 5 
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more superior subjects. Mere presence of 
an hypothesis concerning a "crucial" 
aspect seems, if anything at all, nega- 
tively related to successful solution for 
this particular problem. It is probably a 
matter, not of having an hypothesis but 
of quality of the hypothesis, since a poor 
hypothesis is not merely indifferent but 
actually detrimental to the speed of 
solution on the geometrical-figure prob- 
lem. 

(e) Application of past experience. No 
subjects reported the direct application 
of specific principles which could be 
tabulated. 

(f) Varied trials. Variability correlated 
+.87 (Р.Е. = .07) with successful solu- 
tion, so varied trials seem to be highly 
related (positively) to successful solution, 
because our specific measure of variabil- 
ity (changed starting-point) eventually 
led to the correct starting-point. 

(g) Control. Control correlated —.61 
(P.E. — .11) with successful solution, so 
(our measure of) control seems to be 
moderately related (negatively) to suc- 
cessful solution, because isolation and 
repetition of same starting-point pre- 
vented the subject from trying the only 
two correct starting-points. 

(h) Elimination of sources of error. Of 
those reporting a source of error, there 
were more inferior subjects than superior 
ones. And of those not reporting a source 


TABLE :3 


Number of Trials 
Involved in Solution 


12 12 
Superior | Inferior 
Subjects | Subjects 


Detection of source of 


error (N= 10) 4 6 
No detection of source of 
error (N —14) 8 6 


of error, there were more superior sub- 
jects than inferior ones. In stating that 
elimination of a source of error was not 
related. positively to successful solution, 
it must be remembered that these errors 
(mainly blind-alleys) could not prevent 
failure based almost entirely on a single 
factor, namely: starting-point, a peculiar- 
ity of this particular problem. 

(i) Visualization. The number of vis- 
ualized  possible-next-moves | correlated 
+.37 (P.E. = аз) with successful solu- 
tion, so such visualization seems to be 
slightly related (positively) to successful 
solution. 

The total number of methods used 
correlated —.30 (P.E. = 14) with suc- 
cessful solution, and seems slightly re- 
lated negatively, apparently because 
some methods hindered solution. So the 
mere use of more methods does not seem 
to guarantee solution. 

Thus, four methods show some rela- 
tionship to successful solution of this 
problem: (a) Analysis into major vari- 
ables, (b) Locating a crucial aspect, (c) 
Varied trials, and (d) Control. The na- 
ture of the relation (positive or negative) 
reveals that detection of the only im- 
portant variable (correct starting-point), 
by means of varied trials rather than by 
persistence at the same starting-point 
("Control"), leads to rapid solution of 
this puzzle. In Summary, although the 
various methods of attack may be help- 
ful in other problems, only "varied 
trials” and "Analysis into major vari- 
ables" seem to be efficacious for solution 
of this problem, because the first leads to 
discovery of the second, and the second 
guarantees solution. The very construc- 
tion of the figure, offering only 2 correct 
starting-points out of 16 possible starting- 
points, seems to render other methods 
useless. 


pe ` 


ini 


IV. DISCUSSION 


IB INTERPRETING ^the results of this ex- 
periment, certain limitations and 
sources of error must be borne in mind. 


1. The particular question or technique we 
used to determine use of a given method of 
attack was only one way of determining use of 
that method. 

2. The particular question or technique em- 
ployed by us often required a dichotomous 
answer, thus permitting only gross quantification 
of the use of some methods. 

3. There are probably other methods of attack 
besides the ones for which we searched. None 
Was revealed in our investigation. 

4. Methods may be used but not reported. 
And, contrariwise reported use of a method does 
not guarantee that it was used by the subject. 
Our experimental technique revealed itself to 
be too dependent upon introspective report. 

5. In interpreting the absence of efficacy of 
certain methods of attack, it is important to 
distinguish between the methods reported by 
untrained individuals and the possibility of ap- 
plication of these methods by trained problem- 
solvers, 

6. Besides the various methods of attack used, 
other intellectual factors are of course partly 
basic to successful solution of these problems, 
factors not appearing as methods but contribut- 
ing nevertheless to solution. 

7. We used a highly select sample of subjects. 

8. The three problems which we used, al- 
though different in nature, меге hardly rep- 
resentative of typical problems. 


In the following interpretations of the 
results of the experiment, Table 14 will 
be used for the first two questions, and 
Table 15 for the third. In Table 14 the 
first 3 columns show the average degree 
ОЁ use (based on a maximum possible use 
of 100%) of each method, by the group, 
for each of the three problems; the miss- 
ing entries indicate that no measure was 
obtainable for certain methods in certain 
problems. The fourth column shows the 
average degree of use for three problems, 
and the last column shows the average 
degree of use for only those problems 
in which a measure of the method was 
Obtainable. 


1. How did individuals differ in the 
methods they used to solve the problems? 

An obscuring factor in the presenta- 
tion of the comparison of individuals 
with regard to use or non-use of various 
methods lies in the fact that the indi- 
viduals differed, not so much in the kind 
of methods used, but rather in the ex- 
tent to which each used a particular 
method. The individual differences in 
extent of use of each method for each 
problem have been indicated in the 
body of this paper; in summary it can be 
stated that extent of use for any one 
method for any one problem appeared to 
be distributed normally, considering the 
limitations of measurement (gross meas- 
urement, dichotomous answers, etc.). If 
we credit an individual with use of a 
method, regardless of degree of use, the 
differences in the number of methods 
used on each problem only approximate 
a normal curve; since each of the three 
distributions represents, not a single vari- 
able, but rather a total use of any or 
all of 9 variables (methods), we would 
not necessarily expect normal distribu- 
tions. 

a. In the induction problem, where 
only 3 methods were used, every subject 
used either э or 3 methods, with the 
mean at 2.4 methods per subject. 

b. In the deduction problem, 
where 8 methods were used, subject used 
from 3 to 7 methods, with the mean at 
4.7 methods per subject. 

c. In the geometricalfigure prob- 
lem, where 8 methods were used, subject 
used from 3 to 8 methods, with the mean 
at 5.6 methods per subject. 

з. To what extent were the methods 
used in different problems? 
In Table 14, the fourth column of fig- 
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TABLE 14 


AVERAGE EXTENT ОЕ USE or METHODS 
(All figures based on maximum possible or roo% use) 


ë ; Geometri- For For 
Method inductum DEE ion cal Three | Relevant 
Figure | Problems* | Problems* 
Clear formulation 
problem 40% 100% 100% 80% 80% 
Preliminary survey 100% 45% 38% 6т% 61% 
Analysis into major 70%" Same as “crucial 17% 29% 43.5% 
variables aspect” or 
"varied trials" 
Locating a crucial aspect | Not relevant? 749% 50% 419, 62% 
Application of past o% 12% o% 4% 12% 
experience 
Varied trials Same as “major 28% 56% 28% 42% 
variables"? 
Control Same as “major 44% 51% 32% 47.5% 
variables"? 
Elimination of sources of | Not relevant? 5190 42% 3196 46.596 
error 
Visualization Not relevant? 84% 159° 33% 49.5% _ 
Total number of methods 3 8 8 
used on problem 
Mean-number of methods 2.4 4.7 5.6 
used per subject 
Range of number of 2-3 3-7 3-8 


methods used by sub- 
jects on each problem 


a Based on maximum of 8 major variables. 


b Based on maximum of 4 readings (highest number by any subject), 


° Based on maximum of o crucial aspects. 
4 Based on maximum of 8 sources of error. 


* Based on maximum of 29 possible visualized next-moves. 
* These figures are intended as suggestive only, and have not been corrected for small differences in 


the number of subjects in each of the three groups. 


ures shows that some methods seem hardly 
to have been used at all. For example, 
"Application of past experience" seems 
to be almost completely absent (4%), and 
another 5 methods show use of only 28% 
to 33%. Inspection reveals, however, that 
"Application of past experience" was 
reported (by one or more subjects) on 
only one of the three problems; when 
the figure is based on the problem where 


the method was used, we see that 12% 
of the subjects reported its use. Further 
inspection reveals that the problem was 
a human relations situation (death of 
two men), in which past experience 
would more likely be applicable, as com- 
pared with the two problems of a more 
perceptual nature. Even in these, how- 
ever, it is the author's impression that 
finer techniques of investigation might 
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have revealed subtle use of;the methods 
in question. 

For the 5 methods showing use of only 
28% to 33%, inspection reveals that each 
method was measurable on only two 
problems; when the figures are based on 
the two problems where used, the range 
becomes 42% to 49.5%. 

Why were measures unobtainable for 
certain methods on certain problems? 
Two reasons are suggested. 


а. For a given problem, certain presumably 
different methods appeared (to both the ex- 
perimenter and the subjects) to carry the same 
meaning. On the induction problem, for ex- 
ample ried trials" and "Control" (two aspects 
of trial-and-error) consisted of applying generali- 
zations to the groups of letters; this method was 
measured as "Analysis into major variables." On 
the deduction problem, "Analysis into major 
variables" could be considered as looking for 
clues, measured as "Locating a crucial aspect," 
ог as questioning cach of the three possible 
causes (murder, accident, suicide) in relation to 
the death of cach man, measured as “Varied 
trials.” It matters little whether the subject’s 
behavior is classified under one or the other of 
two headings; the important thing is that a 
distinction should be made between “methods” 
different in terminology only and methods dif- 
ferent in actual behavior of the subjects. 

b. A similar point is that, on a given prob- 
lem certain "methods" seem either meaningless 
or irrelevant, On the induction problem, for 
example, the experimenter was unaware of any 
"crucial aspects" among the groups of letters: 
if the common characteristics are crucial, then 
this method is identical with "Analysis into 
major variables." On the same problem the 
experimenter was unaware of sources of error 
in the stimulus-material; and “Visualization” 
seemed to be irrelevant, in the sense that all 
the material was visual and seemed to require no 
additional visualization (imagery). The intro- 
spective reports confirmed the absence of such 
"methods." 


We conclude, therefore, that although 
there is wide variation in the extent to 
which different methods are used, extent 
of use is partly a function of relevancy of 
certain methods to certain problems. In 
Table 14, for example, inspection reveals 
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that only a few methods (3) are used on 
the induction problem, whereas most of 
the methods (8) are used on the other 
two problems; and the average use per 
subject follows the total number of 
methods used. Since the individuals do 
not change, we suggest that the number 
of methods used in different problem- 
situations depends upon the kind of 
problem involved: (а) some problems 
“permit” use of a given method (for ex- 
ample, application of past experience) 
while other problems do not; (b) in cer- 
tain situations a given “method of at- 
tack” becomes identical in meaning with 
a differently worded method of attack; 
and (c) in certain situations a given 
"method of attack" is a term rather than 
a form of human behavior. 

у. To what extent were the different 
methods efficacious? 

Table 15 summarizes the degree of ef- 
ficacy for each method on each problem. 

a. Clear formulation of the prob- 
lem. showed no relationship to solution 
for one problem and was of unknown 
efficacy for the other two problems. It is 
suggested that the task may have been 
quite obvious in our problems, thereby 
permitting no individual differences in 
understanding of the requirements of 
the problem. 

b. Preliminary survey of all aspects 
of the presented material was of slight 
help in the deduction problem, of un- 
known efficacy in the induction problem, 
and of no efficacy in the geometrical- 
figure problem, It is suggested that suc- 
cess in the latter two problems requires 
considerable trial-and-error rather than 
global survey. 

c. Analysis into major variables was 
moderately helpful in the induction 
problem, slightly helpful in the geo- 
metrical-figure problem, and unmeasura- 
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TABLE rs ع‎ 
EFFICACY OF METHODS OF ATTACK 


Correlations indicated when obtainable. Degree of relationship (‘‘Zero" or "slight") indicated 
where correlations could not be derived. Relationships not obtainable for: (a) methods 
not measurable and (b) methods showing no substantial individual differences. 


Method 


Induction Problem 


Deduction Problem | Geometrical Figure 


Clear formulation of problem —. ro 


Used by all: efficacy | Used by all: efficacy 
not obtainable not obtainable 


Preliminary survey 


Used by all: efficacy +.22 Zero 
not obtainable 


Analysis into major variables +.51 
(P.E.=.rr) 


Slightly positive 


Locating a crucial aspect 


PE 9 Slightly negative 


Application of past experi- | Used by none 
ence 


Used by only three: Used | 
efficacy noe быы о BORE 
able 


Varied trials 


Slightly positive +.87 
(P.E.= .o7) 


Control 


Slightly positive —.6r 
(P.E.=..rr) 


Elimination of sources of 5 Е 
еггог eE = Slightly negative 

Visualization Zero "esr 
(Р.Е.= .r3) 


Total number of methods | Number of methods 
used insufficient to per- 


mit correlation 


—.30 
(P.E.= .r4) 


ble in the deduction problem. It is sug- 
gested that this method tends to be help- 
ful because it may expose a variable 
fundamental to solution (along with un- 
important variables). 

d. Locating a crucial aspect was 
slightly helpful in the deduction prob- 
lem, slightly disadvantageous in the geo- 
metrical-figure problem, and unmeasura- 
ble in the induction problem. It is sug- 
gested that the potential efficacy of the 
search for crucial aspects is contingent 
upon the time or trials available to the 
problem-solver: with unlimited time (as 
in the deduction problem), the method 
tends to be helpful; with limited time 
or trials (as in the geometrical-figure 


problem), the method may obstruct solu- 
Чоп, especially if one hypothesis be per- 
severated (Maier’s “false direction"). 

€. Application of past experience 
Was Not reported by our subjects, so no 
measure of efficacy was obtainable. On 
the Seometrical-figure problem, however, 
the work-sheets indicated a tendency to 
Start at the same point (usually the up- 
per-left) on consecutive trials, thus mak- 
Ing solution impossible. It is suggested 
that previous practice in tracing figures 
acts às negative transference, because 
most tracing-figures can be solved from 
any starting-point, so long as care is ex- 
ercised to avoid blind-alleys. 

f. Varied trials was slightly helpful 


AD qe 


— ү 
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in the deduction: problem, paramount in 
the geometrical-figure problem, and un- 
measurable in the induction problem. 
Recognizing, however, that in the geo- 
metricalfigure problem our only meas- 
ure of variation was variation of starting 
point, and that correct starting-point is 
the only important factor for this partic- 
ular problem, it is suggested that the 
efficacy of variation is dependent upon 
its potentiality of exposing the major 
variable or crucial aspect of a problem. 

g. Control was slightly helpful in 
the deduction problem, detrimental in 
the geometrical-figure problem, and un- 
measurable in the induction problem. It 
is suggested that the efficacy of "isolation 
and repetition" of variables felt to be im- 
portant by the subject, or the efficacy of 
aspects felt to be crucial, is dependent 
upon the subject's luck or skill in test- 
ing the important variable or aspect. 

h. Elimination of sources of error 
was moderately helpful in the deduction 
problem, slightly disadvantageous in the 
geometrical-figure problem, and unmeas- 
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urable in the induction problem. It is 
suggested that the efficacy of this method 
is dependent upon its potentiality of ex- 
posing significant features of the situa- 
tion. 

i. Visualization showed а small 
positive correlation with successful solu- 
tion in the geometrical-figure problem, 
no effect in the deduction problem, and 
was unmeasurable in the induction prob- 
lem. Visualization could be of no help 
on the geometrical-figure problem unless 
a subject could visualize all 29 moves, 
because 2 of the 16 starting-points prac- 
tically guarantee solution, whereas 14 
starting-points make solution impossible 
with or without visualization. The small 
positive correlation in the geometrical- 
figure problem may indicate the typical 
ро itive correlation of abilities rather 
than a causal relationship. It is suggested 
that, as with several other methods al- 
ready discussed, the efficacy of visualiza- 
tion (and other imagery) is dependent 
upon the type of problem. 


V. SUMMARY AND CONCLUSIONS 


HE LITERATURE on reasoning reveals 
ii ipei methods, techniques and 
auxiliary aids used by subjects during 
problem-solving. Nine of such known 
"methods of attack" were selected for 
investigation in this experiment. 

1. Clear formulation of the problem. 

2. Preliminary survey of all aspects of 
the presented material. 

8. Analysis into major variables. 

4. Locating a crucial aspect of the 
problem. 

- Application of past experience. 
Varied trials. 

Control. 

Elimination of sources of error. 

9. Visualization. 

Three different problems were selected 
to be given to the subjects serving in this 
experiment, The induction problem was 
a short concept-formation test (identifica- 
tion of similarities and differences in 
letter-grouping series) devised by Pro- 
fessor Thurstone; 19 items were used. 
The deduction problem, chosen from 
one of Professor Thurstone's reasoning 
tests, presented a paragraph of clues to 
the death of two men; the subject is to 
ascertain the cause of the death of each 
man. The general problem was a geo- 
metrical-figure problem of a maze-tracing 
variety. The problems were administered 
individually to a group of 25 under- 
graduate students, with an unlimited 
time-allowance. A number of specific 
questions were asked of the subjects dur- 
ing and after the work-period, and ob- 
jective records of the work-sheets were 
kept, designed to determine the use of 
the various methods of attack selected 
for investigation. 

Our problem was to determine the 
nature and efficacy of the methods of at- 
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tack used on different problems. The 
three main questions, and the results, 
were às follows. 

1. How did individuals differ in the 
methods they used to solve the problems? 

а. If we credit an individual with 
use of a method, disregarding degree of 
use, the number of methods used was 
not distributed normally for either all 
three problems combined or for any onc 
problem. It appears to us that normal 
distributions would not necessarily be 
expected, considering that each distribu- 
tion represents not a single variable or 
continuum, but rather a total use of any 
or all of nine variables (methods). 

b. In the induction problem, every 
subject used either two or three methods, 
with the mean at 2.4 methods. In the 
deduction problem, subjects used from 
3 to 7 methods, with the mean at 4.7 
methods. In the geometrical-figure prob- 
lem, subjects used from 3 to 8 methods, 
with the mean at 5.6 methods. 

c. Individual differences in degrce 
of use of a particular method on a given 
problem appeared to be distributed 
normally, considering the limitations of 
measurement. 

2. To what extent were the methods 
used in different problems? 

There was wide variation in the extent 
to which different methods were used, 
certain methods being used on one prob- 
lem, and other methods on a different 
problem. Two methods were used on all 
three problems, six methods were used 
on two problems, and one method was 
used on only one of the three problems. 
Examination of the tasks required in the 
three problems suggests that, for a spe- 
cific problem: (а) certain presumably- 
different methods appeared to carry the 
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same meaning, and (b) секаіп so-called 
"methods" appeared to be meaningless 
or irrelevant. 

3. To what extent were the different 
methods efficacious? 

a. For a given problem 
methods were efficacious, some were in- 
different, and some were detrimental to 
solution. The latter case seemed to occur 
whenever a particular method served to 
prevent or postpone discovery of the 
variable or aspect fundamental to solu- 
tion of the problem. 

b. Because no method was used on 
all three problems (except “Clear formu- 
lation," used by almost everyone, and 
thus of unmeasurable efficacy), efficacy of 
a given method was demonstrable for 
only one or two of the three problems. 
For the latter case (two problems), two 
methods were efficacious оп both prob- 
lems, two methods were efficacious on 
one problem and “indifferent” (zero cor- 
relation) on the other, and three methods 
were efficacious on one problem and 
"detrimental" (negative correlation) on 
the other. 

In interpreting the results, however, 
certain limitations and sources of error 


some 


were recognized: 


1. The particular question or technique used 
to determine use of a given method was only one 
way of determining use of that method. 

2. The particular question or technique em- 
ployed by us often required a dichotomous 
answer, thus permitting only gross quantifica- 
tion of the use of methods. 

3. There were probably other methods used 
which escaped our tools of search. 

4. Some of the nine methods may have been 
used but not reported and not demonstrated in 
the work-sheets. 

5. Various uncontrolled intellectual factors 
were also contributing to successful solution. 

6. We used a highly select sample of subjects. 

7. The three problems which we used, al- 
though different in nature, were hardly rep- 
resentative of typical problems. 


Four general conclusions are suggested 
as a result of this experiment. 

1. The extent of use of a particular 
method of attack varies with different 
problems and is dependent upon the 
Кіа of task involved. 

2. On a particular problem, certain 
presumably different methods may be 
identical. 

з. On a particular problem, certain 
so-called “methods” may be meaningless: 
mere terms having no meaningful refer- 
ence to possible behavior. 

4. The potential efficacy of a particu- 
lar method of attack depends upon the 
kind of problem involved. 
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THE VALIDITY OF A MULTIPLE-CHOICE PROJECTIVE TEST 
IN PSYCHOPATHOLOGICAL SCREENING! 


I. INTRODUCTION 


HERE is marked need for a test which 
Te be rapidly administered and 
scored, and which has sufficient validity 
for use as a screening device in the de- 
tection of psychopathology. The present 
study is an attempt to determine whether 
the projective test approach? may be util- 
ized for this purpose. 


In the past decade, more and more clinical 
psychologists have turned to the use of projec- 
tive tests in their attempts to identify psycho- 
pathology. Most of the projective tests have been 
standardized under conditions of individual ad- 
ministration and free association responses. Such 
tests have not been suitable as screening instru- 
ments since a great deal of time is required 
by a highly skilled psychologist to administer 
and interpret the results for each. subject. How- 
ever, there have been some projective tests in 
which the subjects selected their responses from 
several multiple choices listed, permitting rapid 
and objective scoring by untrained personnel? 
The most widely used test of this type has been 
the Harrower Multiple Choice Rorschach (19). 


‘Sponsored by the Veterans. Administration 
and published with the approval of the Chief 
Medical Director. The statements and conclu- 
sions published by the author are a result of his 
own study and do not necessarily reflect the 
opinion or policy of the Veterans Administra- 
tion. -- 

? Comprehensive discussions of the projective 
methods may be found in Frank (1:2), Sargent 
(46) and White (57). Each of a series of reviews 
by Symonds and Samuel (49); Symonds m 
Krugman (48), and Hertz, Ellis and Symonds 
(21) has summarized the literature published 
since the preceding review. Briefly, the projec 
tive test approach is defined as one in ын 
the subject is required to structure an ambigu- 
ous stimulus. In doing so, he З presumed to 
reveal his own personality structure. 

There Wave SAT bs some attempts to 
quantify tests calling for free аа 
sponses, by organizing the responses E о Te 
categories which are scored linearly; G2 ae 
Rorschach Inspection Technique as on By 
Munroe (37) and modified by Ross an toss 
(45). It is felt that the validity of such teste as 
Screening devices should be investigated LA 
completely objective scoring system has 
proven inadequate. 


Inkblots were also employed as stimuli by Ter- 
man (52) and by Harrower and Grinker (4, 17). 
Pictures were used as stimuli by Clark (5), 
Franck (11) and Horowicz (23) Murray and 
Stein (38), Szondi (7, 41) and Temple and 
Amen (8, 51). Single words served as stimuli 
for Crown (6), Malamud (31), Maller (34%, 
Mittelman and Brodman (36) and Pressey (39, 
40), while story plots were used by Kelly and 
Bishop (27), Taylor (50) and Wolfenstein 
(59) - 

A review of the literature indicates that most 
of the multiple choice projective tests achieved 
little discrimination. In those cases where a fair 
amount of validity was reported, faults in the 
experimental design, such as improper cross 
validation, use of extreme groups, or testing 
of too few subjects, made the validity coeffi- 
cients suspect. On the other hand, none of the 
studies which found no discrimination was suffi- 
ciently comprehensive to warrant a negative con- 
clusion; namely, that the limitations inherent in 
any attempt to restrict responses to those listed 
on a test blank preclude valid discrimination 
with a projective test. Therefore, the present 
experiment was undertaken in an effort to 
determine the potentialities of the multiple 
choice projective test approach in the detection 
of psychopathology. 


A multiple choice projective test was 
constructed and administered under 
what were considered "optimum" con- 
ditions; ie, under those conditions 
which seem most likely to result in the 
detection of psychopathology. If, with 
such a test, a fair degree of validity were 
obtained, then further experimentation 
with this approach would seem indi- 
cated. On the other hand, if very little 
or no validity were obtained, then it 
would appear more profitable to explore 
other approaches—unless, of course, it 
was established that optimum conditions 
did not exist so that factors which might 
lead to radically different results were 
not included in the present experimental 
design. 


П. EXPERIMENTAL DESIGN 


WO AREAS where optimum condi- 
T must exist may be distin- 
guished: first, in the selection of the par- 
ticular psychopathology to be identiñed, 
and second, in the test methods used to 
identify this psychopathology. 


A. TYPE OF PsYCHOPATHOLOGY SELECTED 
FOR IDENTIFICATION 


In selecting the psychopathology to 
be identified, it is recognized that the 
grosser such psychopathology is, the 
more optimum conditions will be for 
valid identification. However, the gross- 
ness of the psychopathology must be 
limited so that useful conclusions may be 
drawn from the results. For example, the 
successful identification of deteriorated 
psychotics by a multiple choice projec- 
tive test would be of little value. A 
screening device is not needed for this 
purpose, and no inferences could be 
drawn as to the probabilities of finer and 
more useful psychopathological identi- 
fication. Therefore, the grossness of the 
psychopathology selected for identifica- 
tion in the present investigation was 
limited to such distinctions as would be 
required of a screening device. 

Since projective tests have been used 
most successfully in the detection of 
psychopathological diagnoses, the at- 
tempt was made in the present investiga- 
tion to differentiate among three diag- 
nostic categories: normals, anxiety reac- 
tions, and schizophrenics. These particu- 
lar categories were selected because they 
provided adequate distinctions and be- 
cause their frequency made available а 
sufficiently large number of subjects in 
cach category. A more detailed consid- 


"It was recognized that specific Situations in 
which a test would be required to discriminate 


eration of these diagnostic categories will 
be presented in section III. 


B. THE DEVELOPMENT OF A MULTIPLE- 
CHOICE PROJECTIVE T 


EST 


The review of previous research with 
multiple choice projective tests disclosed 
no test with sufficiently promising results 
to warrant further exploration of that 
particular instrument. Therefore, a new 
test was constructed, based on the results 
obtained with individual, group, and 
multiple choice projective tests, and up- 
on recommendations of various authori- 
ties in the field of projective testing. In 
this test, certain conditions were fixed 
since time considerations limited the 
length of the test. However, it was pos- 
sible to vary other factors within the test 


so that the effect of such variation could 
be estimated. 


1. Factors Not Varied 


a. Stimuli: The Rorschach inkblots 
were selected as the stimuli because the 


Rorschach appears to be the best projec- 
tive test for the prediction of psycho- 
pathology. 


According to Rapaport and Schafer, “The use 
of the Rorschach Test, as one in a battery of 
tests, in a clinical-psychiatric setting for pur- 
یک‎ 
only among subjects in 
classifications would r 
Purpose of the expe 
а test for practical us 
the multiple choice 
capable of valid psy 
tion. If potential y, 


these three diagnostic 
rarely arise. However, the 
riment was not to develop 
е, but to determine whether 
projective test approach is 
chopathological difjerentia- 
- alidity is established, then 
osea be made to develop a test for practical 
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poses of diagnoses and appraisal»of adjustment 
shows . . . . that it is beyond doubt the most 
potent single diagnostic instrument clinical psy- 
chology possesses" (42, p. 73). Brody, in review- 
ing the literature on mental testing from 1938 
to 1942, declared: "Increased use of mental tests 
for the study of psychoses and allied disorders 
is conspicuous in recent psychiatric practice. 
This increase is most prominent first in the use 
of the Rorschach ink blots" (3, p. 127), Symonds 
and Krugman point out the superiority of the 
Rorschach indirectly with their statement: "Next 
to the Rorschach, the Thematic Apperception 
Test has claimed most attention. Indeed, in 
humerous testing programs the Thematic Ap- 
perception Test and the Rorschach are listed as 
the two preferred methods for the study of 
personality. Actually, much yet remains to be 
done in exploration of the use of the Thematic 
Apperception method before it can receive gen- 
eral clinical use" (48, p. 88). Michael and Buh- 
ler (35) found the Rorschach to be supcrior as 
a diagnostic instrument to a variety of other 
tests both projective and non-projective. 
While in confirmatory experiments (2, 15, 
26, 33, 47, 58) poor results were obtained with 
the Harrower Multiple Choice Rorschach, which 
used the Rorschach inkblots as stimuli, such re- 
sults might well have been caused by inade- 
quacies of the specific test rather than by the 
nature of the stimuli. Hunt and Stevenson have 
remarked that ". . . . had Harrower-Erickson 
followed the usual painstaking steps of item 
analysis and test validation, relying more on 
empirical investigation and less on a priori 
judgment in selecting her alternative responses 
апа constructing her scoring key, the results 
might have been different" (24, p. 110). 


b. Method of administration: As the 
purpose of the present study was to con- 
struct and evaluate a test which could 
be rapidly administered, it was thought 
desirable to follow a group administra- 
tion procedure. ‘The Rorschach slides 
were used since various investigators (18, 
20, 22, 29, 30) have found that responses 
made to these slides are comparable to 
those made to the individual cards. 
These slides were projected on a screen 
in a semi-darkened room. An alternative 
Possibility, the use of booklets in which 
the Rorschach cards were printed, was 
eliminated because of the expense of 
printing such booklets, and because the 
time of stimulus presentation would 


have been difficult to control, particu- 
larly with the non-normal subjects. 

с. The use of multiple choice ques- 
tions: It is only with some form of a 
multiple choice question that the re- 
quirement of rapid objective scoring can 
be met. 

d. The empirical determination of the 
scoring weights to be given each те- 
sponse? Results obtained with the Har- 
rower Multiple Choice Rorschach by 
Due, Wright and Wright (9), Malamud 
and Malamud (31, 32); Lawshe and 
Forster (28), and Blair and Hochn (2) 
have clearly indicated that a reponse to 
a multiple choice Rorschach might not 
have the same psychopathological sig- 
nificance as the same response to the free 
association Rorschach. Therefore, the 
psyChopathological significance of multi- 
ple choice Rorschach responses must be 
determined independently. 


2. Factors Varied within the Test 


The factors which were varied are 
summarized below. They will be dis- 
cussed more fully in subsequent sections. 


a. Structure of the questions: 

(1). A “relative” versus an "absolute" deci- 
sion by the subject. In some questions the sub- 
ject was required to select one of four percepts 
(relative decision), while in other questions the 
subject was required to rate a single percept on a 
five point rating scale (absolute decision). 

(2). Differences in the types of alternatives 


? At this point, it would seem advisable to de- 
fine several terms which ordinarily may have 
different connotations. 

A percept is a possible description of the 
whole slide or some part of the slide. It may be 
specific, such as “A bat"; abstract, such as 
“Death”; or a denial, such as “Nothing at all.” 

An alternative is one of the multiple choices 
listed in a question. It may be a percept, as when 
the subject is required to select one of four per- 
cepts listed, or it may be a rating, as when the 
subject is required to select one of five ratings 
on a rating scale. 

A response is the answer a subject makes to a 
question. A subject's response can be either one 
of the alternatives listed in a question, or it may 
be a failure to select any alternative. 
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included in a question. The types of alternatives 
were varied to form the following question “de- 
signs." 

(a). Design I (forced "normal" choice) 
contained what were considered on a priori 
grounds to be two percepts characteristic of 
normals, one percept characteristic of anxiety 
reactions, and one percept characteristic of 
schizophrenics. 

(b) Design II (unlimited choice) con- 
tained one “normal” percept, one "anxiety reac- 
tion” percept, one “schizophrenic” percept, and 
one percept worded "Something else.” 

(с). Design ПІ (forced “abnormal” 
choice) contained two “anxiety reaction" per- 
cepts and two "schizophrenic" percepts. 

b. Time of stimulus presentation: As subjects’ 
immediate perceptions might differ from their 
more reflective perceptions, the time of stimulus 
presentation included both a brief exposure of 
the slides and an exposure which gave the sub- 
jects as much time as they required. 

c. Length of test: 'The total test battery was 
made long enough so that information might be 
obtained as to the effect of increasing the size 
of the test by comparing the results obtained 
from sections of the test with those obtained 
from the total test. 

d. Method of determining scoring weights for 
each response: While all methods depended upon 


?'TThe diagnostic significance of the various 
percepts had to be determined on an a priori 
basis because no empirically established norms 
were available. Furthermore, it was necessary 
to base these a priori judgments on the results 
obtained with the free association Rorschach, 
since so little was known about multiple choice 
Rorschach responses. 

The writer would like to express his apprecia- 
tion to Drs, Bruno Klopfer and Zygmunt А. Pio- 
trowski for their assistance in the selection of 
the percepts believed to be characteristic of the 
different diagnostic categories. The method by 
which the different percepts were chosen as 
characteristic of normals, anxiety reactions, and 
schizophrenics for the different inkblots is illus- 
trated below by the way in which the “normal” 
percepts were selected for the first Rorschach 
blot. 

The writer, after consultation with various 
experienced Rorschach examiners, drew up a list 
of about 15 percepts which were considered to 
be the most characteristic of normals. Drs. 
Klopfer and Piotrowski examined this list inde- 
pendently and each checked off the eight per- 
cepts they considered most characteristic. In 
some cases, they substituted new percepts as 
better choices than those listed. The writer used 
the selection of Drs. Klopfer and Piotrowski as 
a basis for choosing nine percepts for each card 
Then, when the structure of a question called 
for a “normal” percept, one of these nine per- 
cepts was used. Which particular one was selected 
was determined in a relatively random fashion, 
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the relative percentage of responses made by 
two groups, various formulae were employed 
which required varying amounts of difference 
between the two percentages for responses to be 
given scoring weights. 


3. The Final Test Battery 


Having determined the рѕусһора- 


thology to be identified (normal, anxiety 
reaction, and schizophrenia as discussed 
in section II, A) and the general composi- 
tion of the test to be used for this identi- 
fication, the next step was to construct 
a specific test. After experimentation 
with preliminary groups, the following 
test battery, termed the Singer Multiple 
Choice Rorschach (SMCR) to differenti- 
ate it from the Harrower Multiple 
Choice Rorschach, was developed. (A 
copy of the SMCR is shown in Appendix 
A.) The battery consisted of three tests. 

a. Test A: 
(1). Number of questions: 30 

_ (2). Number of percepts listed in а ques- 
tion: 4 

(8). Type of percepts in a question: There 
were 10 questions each of Designs I, II and Ш. 
(ле three questions for each slide included one 
of each design. The order in which the designs 
Were presented was varied systematically.) 

А (4). Response by the subject: The subject 
was required to select one of the four percepts 
in each question. 

(5). Time of stimulus presentation: A slide 
was shown for 10 seconds for each question. Sub- 


DE were then allowed a few seconds more in 
Which to record their response if they had not 
already done so. 
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were 10 questions for each slide? divided as fol- 
lows: 

(a). Four “normal” percepts. 

(b). Three “anxiety reaction” percepts. 

(с). Three “schizophrenic” percepts, one 
of which was “Nothing at all.” 
: (4). Response by the subject: Each ques- 
tion (percept) was rated by the subject on a 
five point absolute rating scale as follows: 

(a). Exactly like it; 

(b). Very much like it; 

(c). Somewhat like it; 

(d). Only a little like it; 

(c). Not at all like it. 

The term "like it" referred to how much 
either the whole slide or any part of it re- 
sembled the percepts listed. 

(5). Time of stimulus presentation: 

(а). Slide I. Three minutes. 
(b). Slide 11, Two and one-half minutes. 
(c). Slides III through X. Two minutes. 

The decreasing time limits were established 
because it was found that the subjects required 
less time to answer a question as they became 
familiar with the rating technique. Preliminary 
experimentation indicated that over 95% of the 
subjects finished within the above time limits 
when they were allowed unlimited time. 

For Slide I subjects were warned when they 
had one minute left, while for the other slides 
they were informed when there were thirty sec- 
onds remaining. 


The diflerences among the various 
tests may be observed more clearly in 


Table 1. The effect of these differences 
will be discussed in section IV. 


4. Administration of the Test 


The regular 314" by 414^ Rorschach 
slides, purchased from the Psychological 
Corporation, were projected on a screen 
by a lantern projector illuminated by a 
lamp of at least 500 watts. 


The screen image was generally at least 4' by 
5’ in size. In a few cases the image was slightly 
smaller because the size of the testing room 
made it impossible to focus a larger clear image. 
However, at such times the subjects were seated 
slightly nearer to the screen than in the cases 
where the 4’ by 5’ image was employed. It was 
possible, by having the light enter from the back 
of the room, to keep the image on the screen 
quite clear, while enough light existed for the 
subjects to read the alternatives on their test 
blaaks. The room was kept in this semi-dark con- 
dition throughout the test. Subjects were seated 
at least 12 feet from the screen and at an angle 
of less than 30? from the center of the screen. 


Before each test began, the subjects 
were given the answer sheets which listed 
the various possible responses for that 
test. They were told that slides would be 


TABLE 1 
CHARACTERISTICS OF THE DIFFERENT TESTS IN THE SMCR 


Differences in Types of Percepts* 
pest Ше ef Response Required Design I Design II Design HI — 
S Н a 
Ex DDsurë of Subjects No. | Percepts | No. | Percepts | No. | Percepts 
i of ina of ina of ina 
| Quest. Quest. | Quest. Quest. Quest. Quest. 
A ri Select 1 of 4 per- то | N-2 то | N-r то АК-2 30 
Brief cops AR-1 | AR-1 S-2 
5-1 5-1 | | 
| "зт | 
B | Relatively | Select 1 of 4 per- | 10 N-2 10 | № то АК-2 3o 
Unlimited | cepts RN | era 5-2 
| "Pu 
C | Relatively | Rate each percept | 4° | Na 39 | AR- | зо S 190 
Unlimited | on an absolute rat- | 
ing scale. | 
; ri basis to be characteristic of normals, AR for one 


* N stands for a percept considered on an a prio 
Characteristic of anxiety reactions, and S for one c 
the alternative “Something else.” 


haracteristic of schizophrenics. The ‘'?” stands for 
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projected on the screen, and that their 
task was to select a response while the 
slide was on the screen. No references 
were made to the Rorschach or to ink- 
blots. The specific procedure for each 
test was as follows: 


a. Test A. 'The first Rorschach slide was pro- 
jected on the screen for ten seconds, during 
which time the subjects checked one of the 
four alternatives in Question 1. The slide was 
taken off the screen and after a few seconds 
the same slide was again projected for 10 seconds 
while the subjects answered Question 2. After 
an interval of a few seconds, the slide was shown 
for another 10 seconds while the subject an- 
swered Question 3. The second Rorschach slide 
was then shown for 10 seconds with the sub- 
jects responding to Question 4. This procedure 


о 
continued, with a new slide being used for cach 
three questions, until all зо questions were 
answered. 

b. Test B. The first Rorschach slide was again 
projected on the screen while the subjects an- 
swered Questions тА, 2A and 3А. The slide 
remained on the screen for go seconds while 
all three questions were answered. Then the 
second Rorschach slide was shown for до seconds 
while the subjects responded to the next three 
questions, and so on until all 3o questions Were 
answered. 

c. Test C. The first Rorschach slide was shown 
on the screen for three minutes while the sub- 
jects rated all 10 percepts for that slide. The 
second slide was projected for two and a half 
minutes while the subjects rated the second 
10 percepts. Then the other slides were shown 
in order for two minutes each while the subjects 
rated the 10 percepts listed for each slide. 


À 


HI. DESCRIPTION OF THE POPULATION TESTED 


T" THE original experimental design, it 
was planned to establish the scoring 
weight for each response empirically 
from the results of 150 subjects, consist- 
ing of 5o normals, 50 anxiety reactions, 
and 50 schizophrenics, and to cross vali- 
date those weights on a second set of 
subjects. (The term "set" will always 
refer to a group consisting of 50 subjects 
in each of the three diagnostic cate- 
gories.) However, since it proved impos- 
sible to obtain subjects in the various 
diagnostic categories who were thor- 
oughly evaluated, and since the scoring 
weights established from only 50 subjects 
in each category proved highly unstable, 
a third set of subjects was tested to pro- 
vide a larger sample upon which to 
establish scoring weights. After the test- 
ing had been completed, it was found 
possible to add a fourth set consisting of 
an “excess” of properly qualified subjects 
who were not needed for the third set, 
and subjects who had been excluded 
from previous sets because they did not 
quite meet all the requirements. 

Over 2,000 subjects were tested, as it 
frequently was necessary, for administra- 
tive reasons, to include extraneous sub- 
jects in a testing group. The total num- 
ber of groups tested was 90, of which 39 
included only normals, 27 included only 
hospitalized schizophrenics, and the re- 
maining 24 contained both anxiety re- 
actions and out-patient schizophrenics. 
The results of 653 subjects, 250 normals, 
203 anxiety reactions, and 200 schizo- 
phrenics, were used. 


A. AGE AND EDUCATION 


Information was obtained as to the age and 
education of the subjects, and the three diag- 
nostic groups proved to be fairly homogencous 
in these respects. The mean age of the ‘normals 
was 25.0 years, that of the anxiety reactions was 


30.0 years, and of the schizophrenics 28.4 years. 
Standard Deviations were 5.0, 6.6 and 6.4 re- 
spectively. There was a great deal of overlap 
among the groups with 93% of the normals, 87%, 
of the anxiety reactions, and 87% of the schizo- 
phrenics falling between the ages of 20 and 35. 

The educational level of the subjects was de- 
termined by the highest grade completed in 
school. Information was lacking for 5 normals, 
6 anxiety reactions, and 21 schizophrenics, but 
there was no reason to believe that the educa- 
tional level of these subjects was much different 
from that of the others. The mean highest 
grades reached were 13.2 for the normals, 11.4 
for the anxiety reactions and 11.5 for the schizo- 
phrenics, with respective Standard Deviations 
of 2.0, 2.5 and 2.6. Seventy-eight per cent of 
the normals, 45%, of the anxiety reactions, and 
46% of the schizophrenics were graduated from 
high school. The percentages for those complet- 
ing the second year of high school were 96%, 
71% and 72% respectively. The few subjects who 
had, less than an eighth grade education (6%, 
of the anxiety reactions and 7% of the schizo- 
phrenics) were generally older or foreign born 
individuals. 


No attempt was made to equate edu- 
cational level, since the lower scholastic 
accomplishment is characteristic of the 
general anxiety reaction and schizo- 
phrenic population. It was felt that there 
was sufficient overlap among the three 
diagnostic groups to preclude any sub- 
stantial differences arising merely as a re- 
flection of variations in education. The 
possible effect of differences in age and 
amount of schooling has been evaluated 
in section V,B,8. 


B. DESCRIPTION OF SUBJECTS IN EACH 
DIAGNOSTIC CATEGORY 
The general characteristics of the sub- 
jects in each diagnostic category will be 
discussed first; then the composition of 
the four different sets will be presented. 


1. Normals 
Subjects in this group were drawn 
from veterans who came to the New 
York Regional Office of the Veterans Ad- 
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ministration for a wide variety of non- 
psychiatric reasons (e.g, to obtain in- 
formation relating to insurance, benefits 
under the G.I. Bill of Rights, medical 
or dental treatment, etc). All such vet- 
erans were first required to visit the Con- 
tact Division where they were inter- 
viewed and then routed to the appropri- 
ate section of the VA. Since the flow of 
veterans varied greatly during the weck, 
at peak periods it was necessary for them 
to wait an hour or more before being 
interviewed. At this time, an announce- 
ment was made over a loud speaker ask- 
ing male veterans of World War II to 
volunteer to take a personality test while 
they were waiting. АП such veterans who 
took the test were identified merely by a 
number which they placed on each page 
of their test blank. This was done to in- 
sure honesty of response as much as pos- 
sible. 

Subjects were selected as normals on 
the basis of their responses to two ques- 
tionnaires, one of the Cornell Selectee 
Index, Form N (53, 54. 55. 56), and the 
other a six-item questionnaire (listed in 
Appendix B) which was validated: 
against the Cornell Index, in an effort 
to include the best adjusted individuals 
in this group. 


2. Anxiety Reactions 


Subjects in this category were obtained 
from the VA Mental Hygiene Services of 
New York, Brooklyn and Philadelphia, 
and from the Rehabilitation Clinic of 
the Jewish Hospital of Brooklyn, the Mt. 
Sinai Hospital Veterans Clinic, and the 
Post Graduate Hospital Veterans Clinic. 
All subjects were out-patients and vet- 

*Only 10 of 233 subjects who answered all 
“No's” to this questionnaire, which was the 
normality criterion, scored over 14 on the Cornell 


Selectee Index, Form N, and only 34 of the 2 
scored over 10. iix 233 


SINGER 


crans of Woild War II. 

Originally, only patients diagnosed by 
a psychiatrist who had treated them for 
at least several months were accepted in 
this group. However, it became impossi- 
ble to obtain a sufficient number of sub- 
jects fulfilling this criterion. As the re- 
search progressed, there was a great de- 
crease in the number of veterans diag- 
nosed as anxiety reactions since, accord- 
ing to several psychiatrists consulted, 
those patients whose maladjustment 
arose primarily out of situational diffi- 
culties in the armed service or in becom- 
ing readjusted to civilian life either re- 
covered or "bound" their anxiety in а 
specific neurotic or psychotic reaction 
pattern. Many patients who retained 
their free floating anxiety would be 
diagnosed by a psychiatrist, after a few 
months of treatment, in terms of the 
dynamic factors underlying the anxiety 
rather than as anxiety reactions. 

Finally, for those patients still diag- 
nosed as anxiety reactions, interrogation 
of the psychiatrists making these diag- 
noses generally revealed that this cate- 
gory was being used as a “waste basket” 
and that many of the patients had either 
a mixed psychoneurosis, or were possible 
latent schizophrenics. 

_ Therefore, in order to secure the de 
sired number of subjects, it became 
necessary to accept a much less rigid cri- 
terion; namely, the diagnosis of a screen- 
ing psychiatrist who had interviewed 
the patient for about 45 minutes. It i5 
recognized that many of the patients 
diagnosed by a screening psychiatrist 
as anxiety reactions might, after more in- 
tensive diagnostic work, be classified 
differently, However, none of these sub- 
Jects may be considered normal in the 
sense of being well adjusted, relatively 
few could have been schizophrenic, and 
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all had at least a fair amount of free 
floating anxiety. 


3. Schizophrenics 

About half the subjects in this group 
came from the Veterans Hospital at 
Northport, while most of the others, 
some of whom were also veterans of 
World War II, came from Brooklyn 
State Hospital. A small number were 
being treated as out-patients in the New 
York and Brooklyn Mental Hygiene 
Services of the VA. 

For the most part, the subjects were in 
relatively good condition. Not only were 
the best patients selected from the two 
hospitals, but the patients had to be will- 
ing and able to fill out a fairly compli- 
cated test satisfactorily. ("Satisfactory" 
standards are described in the footnote 
in Section IV,B,2.) No coercion was used 
to make the subjects take the test, and 
even less aid was given them than under 
many group testing situations, since no 
proctor assistance was made available 
during the test. 

Although question concerning the orig- 
inal diagnosis of schizophrenia existed 
in less than a half dozen subjects, many 
of the patients were considered at the 
time of testing to be at or near remission. 
Almost half the Northport subjects were 
Open ward patients, and many were re- 
leased or sent out on trial visits less than 
two weeks after taking the test. Thus, 
subjects in this category represented the 
"best" schizophrenic population; that is, 
the type of subject who often might not 
be identified in a brief interview or from 
responses to a questionnaire.’ 


2 A survey of the 110 Northport subjects, made 
from one to two years after testing, revealed that 
33 had been discharged, 17 were out on trial 
Visit status, 2 were AWOL (absent without 
leave), 5 had been transferred to other hospitals, 
and 53 remained, Thus, about half no longer 
required hospitalization. Of those remaining, 


No attempt was made to use the schizo- 
phrenic  sub-classifications (catatonic, 
hebephrenic, simple, and paranoid) be- 
cause nearly all psychiatrists felt that the 
sub-classifications they gave to many of 
the subjects were unreliable. 


4. Special Characteristics of the Four Sets 
of Subjects 


a. Set. Cr. While each set of subjects 
was supposed to include go persons in 
each of the three diagnostic categories, 
there were actually 100 normals and 53 
anxiety reactions and 50 schizophrenics 
in this set because some of the subjects 
did not complete the entire battery. How- 
ever, the combined results of the subjects 
in each category was equivalent to results 
which would have been obtained had 
there been 50 subjects in each category 
completing the test battery. 


(1). Normals. These subjects were available 
for but half an hour. Therefore, they were 
screened with only the six-item questionnaire, 
and 5o were given Tests А and B while another 
50 were given Test C. 

(2). Anxiety Reactions. Most of these sub- 
jects were thoroughly evaluated by a psychiatrist. 
There were 53 subjects, 41 of whom took all 
three tests. One subject took only Test A, two 
took Tests А and B, two took Tests А and C, 
and seven took Tests B and C. These discrep- 
ancies occurred because some subjects did not 
answer all the tests satisfactorily, while others 
came in late and missed part or all of Test A. 
Results were available for 50 subjects on Tests 
B and C, but for only 46 subjects on Test A. 
Therefore, the frequencies in the intervals in the 
distributions of the last group were multiplied 
by 50/46 to obtain comparable distributions. 

(3). Schizophrenics. All 5o subjects took 
the complete test battery. Forty-two of them were 
patients at the Northport Veterans Hospital 
while the other eight were out-patients. 


b. Set C2. There were 50 subjects in 
each diagnostic category in this set and 
in each of the two following sets. АП 


many have been in good enough condition to be 
sent out on trial visit or leave of absence since 
their testing. 
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subjects completed the entire test bat- 
tery. 


(1). Normals. These subjects were first 
screened by the six-item questionnaire, and were 
given the Cornell Selectee Index, Form N, after 
completing the SMCR. Only subjects with a 
score of 10 or below on the Cornell Index were 
included in this set. 

(2). Anxiety Reactions. There were 29 sub- 
jects selected on the basis of a screening diag- 
nosis, The other 21 subjects were evaluated by 
the psychiatrist treating them. However, in many 
cases, the psychiatrist had not scen the patient 
long enough to formulate more than a tentative 
diagnostic impression. 

(8). Schizophrenics. 'There were 25 subjects 
from Northport Hospital, and 25 from Brooklyn 
State Hospital. 


с. Set Х. 


(1). Normals. These subjects were selected 
as in Set Ce. " 

(2). Anxiety Reactions. There were 4! mo 
jects selected on the basis of a screening CH Е, 
nosis, while the other 9 subjects меге BH а 
evaluation, in some cases a tentative one, эу 
the psychiatrist treating them. А И 

(3). Schizophrenics. This group inaug asp 
subjects from Northport, and 25 from Bro 
lyn State Hospital. 


d. Set C3. This set was composed of 
subjects who were in excess after the 
quota for Set X had been filled, plus 
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subjects who had not quite met the 
requirements for the previous sets. 


(1). Normals. There were 10 subjects who 
met the requirements outlined in Сә; 9 subjects 
who scored 10 or below on the Cornell Index 
but who had answered one "yes" on the six- 
item questionnaire; 28 subjects who had scored 
between 11 and 14 inclusive on the Cornell, 8 
of whom answered one "yes" and one of whom 
answered two "yes's" to the six-item question- 
naire; and three subjects who completed only 
half of the Cornell Index but whose score on this 
half indicated that they would probably have 
scored under 10 had they completed the test. 

(2). Anxiety Reactions. There were 33 "ex- 
cess” subjects, and 17 subjects who had a sup- 
plementary diagnosis, such as “and depression 
or “in an inadequate personality” which indi- 
cated that they were not relatively pure anxiety 
reactions. There were 18 in the former group and 
11 in the latter whose diagnosis was made by 
a screening psychiatrist, 

(3). Schizophrenics. This group included 
9 out-patients, 18 open ward patients from 
Northport Hospital and 23 patients in the 


process of insulin shock treatment at Brooklyn 
State. Hospital. 


Sets Ci, C2 and Cs were used as cri- 
terion populations upon whom scoring 
weights were established, while Set X 


served as the cross validation popula- 
tion. 


-— 


М 


IV. TREATMENT OF THE DATA 


A. DIAGNOSTIC GROUPS COMPARED 


ORDER to demonstrate that psycho- 

pathology can be detected by a multi- 
ple choice projective test, it would be 
necessary to discriminate successfully 
only between two of the three diagnostic 
groups tested. Therefore, in the treat- 
ment of the data, each pair of two diag- 
nostic categories was considered separ- 
ately. Thus, a comparison was made be- 
tween the normals and the anxiety 
reactions; another comparison was made 
between the normals and the schizo- 
phrenics; and a third comparison was 
made between the anxiety reactions and 
the schizophrenics. For convenient refer- 
ence, each set of two diagnostic groups 
was termed a "comparison pair," result- 
ing in: 

1. А normal-anxiety reaction compari- 
son pair (N-AR) 

2. A normal-schizophrenic comparison 
pair (N-S) 

3. An anxiety reaction-schizophrenic 
comparison pair (AR-S) 


Each of these comparison pairs was 
treated as a distinct unit, and the pro- 
cedure outlined below for the N-AR 
pair was duplicated for the other two. 


B. STATISTICAL PROCEDURE 
1. Method of Determining Scoring 
Weights 

The scoring weight given each res- 
ponse was determined by the relative 
percentage of subjects in each diagnostic 
group selecting that response. The pro- 
cedure followed is outlined below. 


4. The percentage of normals and anxiety 
reactions making a response was computed. 
Possible responses included the various alterna- 
tives listed in a question plus the failure to 
Select any alternative. 
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b. The significance of the difference between 
these percentages was then determined according 
to the formulae outlined in paragraph c. be- 
low. 

(1). If the normals’ percentage was signifi- 
cantly higher, the response was given a weight 
of minus 1. 

(2). If the anxiety reactions’ percentage was 
significantly higher, the response was given a 
weight of plus 1. 

(3). If the difference was not significant, the 
response was weighted o. 

(In the N-S comparison pair, a significantly 
higher normal percentage received a minus 1 
weight, while in the AR-S comparison pair, a 
significantly higher anxiety reaction percentage 
received a minus 1 weight.) 

c. Two different formulae were used to de- 
termine the significance of the difference between 
the percentages. 

^ (1). The first was a formula adapted from 
one outlined by Guilford! (14, pp. 298-301): 


P, — P. 
W=— 
bq 


where W = Weight. When the weight came to 
plus 0.5 or above, a weight of plus 1 was given. 
Similarly, a weight of minus 0.5 or below was 
weighted minus 1. 

P,=the percentage of subjects in group 1 
making a response, 

Р, = the percentage of subjects in group 2 
making a response. 


P, + P: 
p= сазы: 
ї=1ї=р 


(2). The second was the customary formula 
for finding the significance of a difference be- 
tween percentages, namely: 


"In Guilford's formula, a constant, plus 4, is 
added so that possible weights will all be posi- 
tive and range from o through 8. Guilford's 
formula was modified because the writer found 
it more convenient to work with plus and minus 
weights. Weights were restricted to plus or minus 
1 as the very few responses which would have 
received higher weights with the Guilford 
formula would have caused no appreciable dif- 
ference in the results. 
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where C.R. = the critical ratio, and the other 
symbols have their conventional meaning. 
With this formula, the determination of 
whether a difference is significant will depend 
upon the size of the critical ratio required. The 
higher the critical ratio required, the greater the 
difference between the two percentages will have 
to be. Four different critical ratios were used 
as criteria to determine weights as follows: 


(a) А C.R. of 1.6449, which is equivalent 
toa p of .10. 


(b) A C.R. of 1.9600, equivalent to a p 
of .05. 

(с) A С.К. of 2.3263, equivalent to a p 
of .o2. 

(d) A C.R. of 2.5758, equivalent to a p 
of .o1. 


d. Thus, there were five different weighting 

“systems” used: 

(1). That based on the formula of Guilford 
(referred to in future as weighting 
system C). 

Ihe four based on the regular per- 

centage-difference formula where the p 

required to indicate a significant differ- 

ence varied as follows: 

(a) p= ло (referred to as weighting 
system „ло). 

(b) p=.05 (referred to as weighting 
system .05). 

(с) p= оз (referred to as weighting 
system .02). 

(d) p= oí (referred to as weighting 
system .01). 


2, The Responses Weighted 


Since failure to respond to a stimulus 
is frequently a psychopathological sign 
indicative of "stimulus shock” or evasion 
on the individual Rorschach, such fail- 
ures were treated as regular responses. 
Thus, in Tests A and B where the sub- 
jects were requested to select one of four 
percepts, there were five possible res- 
ponses; four of them were the different 
percepts listed, while the fifth was the 
failure to select a percept. Similarly, in 
Test C, the five steps in the rating scale 
gave rise to five possible responses, while 
failure to sclect any rating provided a 
sixth possible response. The relatively 
few cases in which the responses were 
"spoiled" by the checking of more than 
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one percept’ or rating were scored as 
“failure to make a selection.” 

То make sure that failure to make а 
selection did not represent a general 
negativism toward the test situation, the 
results of the subjects who had selected 
too few of the alternatives listed were 
eliminated.? 


3. Criterion Populations 


Scoring weights were determined from 
several criterion populations as follows: 


a. Set Cr. Weights were established upon this 
population for the purpose of scoring the same 
population. It was recognized that such treat- 
ment would lead to a spuriously high differenti- 
ation since all "chance" differences were 
weighted in favor of greater differentiation. 
However, if little differentiation resulted from 
this treatment, then cross validation would be 
unnecessary, as no better results could be ех" 
pected. 

b. Sets Cr and C2 were combined to for" 
population C12. This was the criterion popula- 
tion upon which weights were developed for 
cross validation on Set X. 

c. Sets Cr, C2 and Сз were combined forming 
population C123. This was also used as а cr 
terion population upon which weights were de- 
veloped for cross validation on Set X. Population 
Cı23 was used to obtain information regarding 
the effect of: 

(1). Increasing 
population. 


Е мий 
the size of the criterio" 
critical ratio 
en a scoring 
iven 


(2). Increasing the size of the 
required for a response to be giv 
weight. There were too few responses g 
weights in criterion population C12 when a P ° 
зз Or з was used as a standard LO permit an 


: S. 
adequate judgment concerning these standard 


2 Except for a few anxiety reactions in Set Cr 
subjects were required to select specific s 
tives in at least 60% of the questions IR. 
test, and in 75% of the test battery as азу 
| of 653 gil eae 
subjects responded properly to less than 
the questions in di battery, only 13 respon B 


spon 
and only 29 Te! Pe hen- 


nade his 


selections capriciously, 
the response checks formed 
geometric pattern, no data 
were tabulated or scored. 


etic 0! 
pject 
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4. Weighting Systems Used With Different 
Criterion Populations 


а. With population Cı, only weighting system 
G was used. 
b. With population Cie, the weighting systems 
used were: 
(1). G 
(2). p = 10 
(3). p = -05 
c. With population Cie3, the weighting sys- 
tems used were: 


410 
.02 
EU 


5. Gross Validation 
АП cross validation was done on popu- 
lation X. The statistics used to evaluate 
the differences found between two dis- 
tributions were: 


а. The percentage of subjects identified in- 
correctly at the best critical score, Normal sub- 
jects scoring above the critical score were con- 
sidered incorrectly identified, as were anxiety 
reactions with scores below it. The critical score 
was placed at that point which would result in 
the minimum percentage of subjects incorrectly 
identified in the total population; that is, the 
minimum number obtained when the number 
of anxiety reactions incorrectly identified was 
added to the number of normals incorrectly 
identified. 


Two "best" critical scores were found empiri- 
cally, one based on the distributions of the 
criterion population, and the other based on the 
distributions of the cross validation population. 
The percentage of subjects incorrectly identified 
in the cross validation population at each of 
these critical scores was determined. It is recog- 
nized that the exact percentage of subjects in- 
correctly identified at a critical score will not 
be highly reliable. However, an indication of 
the possible variability can be obtained by not- 
ing the percentage of subjects incorrectly identi- 
fied at a point a few scores higher and a few 
scores lower than the critical score. 

Furthermore, the most desirable critical score 
would vary greatly with the purpose for which 
a screening measure was used. As Ellis and 
Conrad (10) have pointed out, in most practical 
situations the percentage of false negatives or 
the percentage of false positives, as the case may 
be, is of greater importance than a combined 
percentage of wrong identifications. Thus, a 
much higher critical score would be employed 
when the purpose of the test was to definitely 
eliminate anxiety reactions than when the test 
was used to select possible anxiety reactions for 
a more thorough evaluation. However, the pres- 
ent study is attempting to determine generally 
whether any psychopathological screening is pos- 
sible with a multiple choice projective test, and 
consequently it is concerned with the largest 
amount of differentiation possible, rather than 
with the direction of the error. 


b. The significance of the difference between 
the two distributions. This was determined as a 
critical ratio. 


V. RESULTS 


A. PRELIMINARY ANALYSIS UPON POPULATION Ci 


FTER population C1 had been tested, 
A a preliminary analysis of the data 
was made to determine whether further 
testing under the same experimental con- 
ditions was indicated. Scoring weights 
were developed from the responses made 
by the subjects in Cı using the Guilford 
formula (weighting system С). Then 
these weights were used to score the sub- 
jects in Ci. 

Since half of the "normal" subjects in 
Ci took only Test AB, while the other 
half took only Test C, it was not pos- 
sible to obtain a full battery score for 
these subjects. Therefore, the results of 
Test AB and Test C were treated ѕера- 
rately. 

A comparison of the Test AB scores 
received by the normals and the anxiety 
reactions revealed that only 995 were 
incorrectly identified at the best critical 
score. For Test C, only 6% were incor- 
rectly identified. In the normal-schizo- 
phrenic comparisons, only 8% were in- 
correctly identified with each of the two 
tests. An anxiety reaction-schizophrenic 
comparison was made only for Test AB 
where 12.5% of the subjects were incor- 
rectly identified. In all comparisons, the 
critical ratios were over 10. These results 
appeared definitely to warrant further 
testing with the SMCR. 

An analysis was made of the results 
obtained with Tests A, В and С sepa- 
rately, and with various sections within a 
test, to determine whether any modifica- 
tions should be made in the battery be- 
fore further testing was undertaken, 
However, the differences in test condi- 
tions which were investigated, such as in 
the time of stimulus exposure, appeared 
to produce little difference in the results 
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obtained so that no changes were made 
in the test battery, 


B. Cross VALIDATION or THE SMCR 


In the data presented below, unless 
otherwise specified, the results will al- 
Ways pertain to the subjects in popula- 
tion X; that is, the cross validation popu- 
lation, 


1. Discrimination Obtained with Dif- 
ferent Weighting Systems 


a. Percentage of subjects incorrectly 
identified. Table ° presents the scores 
obtained by normal and anxiety reac 
tions subjects when different sets of 
weights are used. 

In this table, as elsewhere, N терге 
sents the normal subjects and AR the 
anxiety reactions. Under "Score" are pre 
sented the lower scores in each step in- 
terval. To the right of each score аге the 
number of subjects found in that step 
interval. At the bottom of the page, “М” 
"SD," and "CR" have their conventional 
meanings. The "95 Wrong" stands for 
the percentage of subjects incorrectly 
identified at the best critical score. 

Tables 5 and 4, presenting the results 
of the normal-schizophrenic and anxiety 
reaction-schizophrenic comparison pairs, 
are similar in form to Table э. 

The amount of discrimination 0b 
tained for the three comparison pairs with 
different sets of weights is summarized 
in Tables 5A and 5B which present. the 
percentage of subjects incorrectly iden- 
tified (% Wrong) and the critical ratios 
respectively, 

As can be most easily observed from 
the summary tables, БА and 5B, the 
Various sets of weights produce a remark- 


ma 3%. 
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° TABLE > 


RESULTS OBTAINED WHEN THE NORMAL AND ANXIETY REACTION CROSS-VALIDATION 
POPULATIONS ARE SCORED WITH DIFFERENT SETS OF WEIGHTS 


Criterion Weights Based on C 123 
Weighting System Weighting System 
Score G ло -05 G ло E .o2 .or 
N AR N AR N AR N AR N AR N | АК N AR N |AR 
20 2 | 4 
16 1 2 3 3 1 3 
12 3 I 2 I z 3 2 4 I 
8 5 2 9 2 5 2 6 т 6 2 4 2 8 т $ 
4 2 s 8 I 16 2 13 3 7 s то 4 7 s š 
° 2 9 3 8 9 10 7 7 8 то тї 13 16 20 13 25 
=7 7 6 14 1 21 13 13 8 10 5 II 1 22 9 зо 10 
< 8 H1 14 6 17 Ж 14 6 5 5 15 6 4 I 2 
g 14 4 II 5 1 7 2 її 5 5 2 x 
=з 8 4 4 1 2 2 9 2 I 
=T7 4 2 1 
-21 3 1 I 
= I 
M* —9.4 |—2.3 —s.7 | 0.4 | 3.4 | 1.3 |—2.0 | 2.5 |—4.8 | 2.4 |—1.8] 2.6 |—o.s | 2.0 | —o.s | a.x 
SD* 7.8 8.6, 6.21 7.1 3.8 | 5.0 7.3 18.5 7-9 |10.1 5.8 | 7.0 3.8 | s.o 2.9 | 3.9 
CR* 4.3 4.5 4.1 3.4 3.9 3-4 2.7 3.8 
% Wrongt 28 29 31 зт зо 33 зо 31 


* These calculations were made from data grouped with a class interval of 2. Because such a table was too large for one 
printed page, the class intervals were enlarged to 4 in the above table. эр Ç . 

t These calculations were made from ungrouped data because in this statistic, unlike the others, grouping errors will oc- 
cur in only one direction; that is, to increase the % Wrong. 


TABLE 3 


RESULTS OBTAINED WHEN THE NORMAL AND SCHIZOPHRENIC CROSS-VALIDATION 
PoPULATIONS ARE SCORED WITH DIFFERENT SETS ОЕ WEIGHTS 


Criterion Weights Based on C 12 Weights Based on C 123 


Weighting System Weighting System 


Score G ло .05 G ло .05 .o2 .or 
NIs|NISINIS]N]|S[N]|S]RN]S]|msTS 


z 
u 


8 
- 
PRONNHOOR RE н юк 
ноло 000 Оо AM u H 
ноч о оон 
өөө 
MO de Com он е 
MUhROOOOR ww 
нн 


1 
© 
M"HMOOOtn RES 
HHOR DOOR DH 
но ORO 
RROD Оо ыл 
мни очил Ф ы 


M* —23.7| —2.2/ 725.0 —7.2| 21. 
SD* m 16.1| 12.1 13.4 7.6 
CR* 7.1 6.9 E 


%Wrongt | 20 22 18 18 21 20 19 22 


*8|—3.2| —14.4/—3.9 
+9! 10.5 8. 3: 
6.8 é d 


9 7.2 


+ 
* These calculations were made from data grouped with a class interval of 3. Because such a table was too large for one 


printed 3 lass intervals were enlarged to 6 іп the above table. — | = Š 
1 "The calawations were made from ungrouped data because in this statistic, unlike the others, grouping errors will occur 


in only one direction, that is. to increase the % Wrong. 


! 
| 
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TABLE 4 


RESULTS OBTAINED WHEN THE ANXIETY REACTION AND SCHIZOPHRENIC CRoss-VALIDATION 
POPULATIONS ARE SCORED WITH DIFFERENT SETS ОЕ WEIGHTS 


Criterion Weights Based on C r2 Weights Based on C хаз 
Weighting System Weighting System 
Score G ло +05 G ло +05 102 .or 
AR s AR s AR s AR s AR s AR s AR s AR s 
20 І | y | | 
16 3 3 
12 | 3 3 1 1 
8 2 5 1 s I I 5 4 1 
5 1 4 1 5 5| 3 3] s 7 3 3 1 š 
° 4 3 3 4 2 š I 2 I 3 3 6 6 14 4 17 
= 2 8 5 то 7 II y 12 4 8 4 10 4 13 15 15 
—$ 2 9 2 9 9 12 3 7 1 6 10]. 14 zt 17 so 
—9 7 5 8 6 6 5 8 6 E 9 9 9| 18 4| 13 3 
—13 6 3 12 2 17 5 6 2 8 7 II 2 8 4 
=e 6 I š 2 9 2| orn 2 7 *[ 43 5 
—21 10 3 6 3 1 8 3 9 1 4 1 
—3$ 3 1 4 2 I 7 4 
—29 5 1 5 
= 33 2 1 1 I 
N* —18.4|—2.7 —13.2/ —3.8| -11.0 —4.6 —12.3| —2.6| —17.3| —7.4| —12.7| — —8.1|—2.0| —5.7| 71:7 
SD* 0058] төй 8.01 te бо) Вр буйы <Š H 10.4) Geal T 3 6 s.s| 39 39 
CR* 5-3 4.7 4.3 4.8 4.8 5.3 5.1 5.1 
% Wrongt 24 24 27 27 24 26 25 31 


* These calculations were made from data grouped with a class interval 
printed page, the class intervals were enlarged to 4 in the above table. 
е 


These calculations were made from ungrouped data because in this statistic, unlike the others, grouping errors will occur 
in only one direction, that is, to increase the % Wrong. 


of 2. Because such a table was too large for one 


TABLE 5A 


SUMMARY OF PERCENTAGE ОЕ SUBJECTS INCORRECTLY IDENTIFIED (% WRONG) 
IN THE CROSS-VALIDATION SAMPLE 


Weights Based on 


Population C 12 Weights Based on Population C 123 Mean | Mean 
Comp. Pair Сха | C123 
G | ло | .05 G | ло | .05 | .02 | ‚ог 
N-AR 28 | 29 31 31 30 33 30 31 29 ET 
N-S 20 22 18 18 21 20 19 22 20 29 
AR-S 24 24 27 27 24 26 25 31 25 27 
Mean | 24 | 25 | 25 25 | 25 | 26 25 | 28 | 25 | 26 
ТАВГЕ $В 
CRITICAL Ratios FOR STATISTICS IN TABLE 5А 
| Population C 12 Population C 123 Mean | Mean 
Comp. Pair C 123 
В | G | ло | -05 G | ло | +05 | .02 | Cae 
N-AR EI | at | Bee | ga | ga J 22 | ok | za 2:4 
N-S 7.1 6.9 7-9 7.2 7-Х 7.0 6.8 б.т 7-3 б 
АК-5 Es | o4 | 4e | 8 | 48 | ка | s | sz | ak x 
Men | 5.6 | sa | sa | sa | ёз | s | 4:9 | se | ss | зт 
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ably similar amount of discrimination. 
For each comparison pair, the middle 
six (of eight) “% Wrong" figures have a 
range of only three percentage points, 
While the critical ratio range is less than 
one point. This consistency clearly indi- 
cates that differences both in the weight- 
ing Systems and in the criterion popula- 
tions upon which the weights are based 
have had little effect upon the amount of 
discrimination produced. 

b. Particular subjects incorrectly iden- 
tified. It is possible that while the same 
percentage of subjects was incorrectly 
identified by each set of weights, there 
may be differences with respect to the 
particular subjects who were incorrectly 
identified. Therefore, a comparison of 
the subjects incorrectly identified by each 
Set of weights was made. 

For each diagnostic group in a com- 
parison pair, there were eight distribu- 
tions, a different one produced by each 
set of weights. The 10 subjects scoring 
lowest or highest, whichever indicated 
incorrect identification, were selected 
from each distribution as the ones most 
Clearly misidentified. "Then, for each 
diagnostic group in a comparison pair, 
the 10 subjects appearing in most distri- 
butions as incorrectly identified were 
considered the criterion “incorrectly 
identified groups.” It was found that 
there was very little difference between 
the 10 subjects selected by any of the sets 
of weights and the subjects in the criter- 
ion group. ba dg 

Because of the marked similarity in 
the results obtained with the various 
sets of weights, it was felt justifiable to 
use only one set in the subsequent treat- 
ment of the data. The weighting system 
P = .10 with weights derived from popu- 
lation C123 was selected because it iden- 
tified so many of the subjects in the 


incorrectly identified group and because 
it led to the widest range in scores, an 
important factor when the results ob- 
tained from only sections of the test bat- 
tery were considered. 


2. Effect of Increasing the Number 
of Subjects 


The fact that adding the subjects in 
population Cg to the criterion popula- 
tion did not increase the validity may be 
attributed to one of two reasons. Either 
the 100 subjects in population C12 pro- 
vided a sufficiently large number to at- 
tain the best possible set of weights, or 
the poorer diagnostic classification of the 
subjects in Cg was enough to balance 
any additional validity to be gained from 
adding 50 subjects to each category in 
the criterion group. To investigate this 
problem, populations Cg and Сз were 
scored separately with weighting system 
P = ло based on criterion population 
C123. Population Ci was not scored be- 
cause a total score could not be obtained 
for the "normal" subjects since each of 
them had taken only half of the testing 
battery. 

In the N-AR comparison pair, the 
percentage of subjects incorrectly identi- 
fied was 18% for C2 and 16% for Сз. In 
the N-S comparison pair, these percent- 
ages were 16% and 17% respectively, 
while for the AR-S comparison pair 
they were 19% and 25%, respectively. 
While it is true that Cs and Cg each 
formed one-third of the criterion popula- 
tion upon which the weights were based, 
still if the diagnostic classification of the 
subjects in Cg was materially inferior, 
this population should have shown much 
less discrimination. Since the percentage 
of subjects incorrectly identified was very 
similar for both Сә and Сз, it would 
seem more logical to conclude that for 
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this test the diagnostic classification of 
subjects in Cg could not be much in- 
ferior to that of the subjects in C9. 
Hence, for the SMCR, the addition of 
subjects to the criterion population be- 
yond 100 per diagnostic category would 
not seem likely to materially increase 
the amount of discrimination produced. 


3. Reliability of the SMCR 


Because subjects were not available for 
retesting, the reliability was determined 
by a measure of internal consistency. 
The test battery was divided into four 
sections: Test A, Test B, the first half of 
Test C, and the second half of Test C. 
Each section was then split into two 
halves according to a table of random 
numbers. Scores on one half of each 
section were summed to give a Part 1 
score which was correlated against a 
Part 2 score derived from the sum of 
scores on the other half of each section. 

For each of the three comparison pairs, 
reliabilities were found for each diagnos- 
tic group separately, and then for both 
groups combined. In the N-AR com- 
parison, the reliability coefficients, when 
corrected by the Spearman-Brown proph- 
есу formula, were .69 + .07 for the nor- 
mals alone, .75 2-.06 for the anxiety 
reactions alone, and .77 = .o4 when both 
groups were combined. In the N-S com- 
parison, the corrected coefficients were 
‚78 + .05 (№), .82 + .05 (S), and .87 + 
og (N-S); while those for the AR-S 
comparison were .65 + .08 (AR), .67 = 
8 (S) and .72-.05 (AR-S). These 
results would indicate that the SMCR 
has a reasonable degree of reliability. 


4. Validity of the SMCR 


There would seem to be no question 
that valid group differentiations have 
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been made ‘for each comparison pair. 
In Table 5B only one of the critical 
ratios was under 3.0, and that came to 
2.7. However, the amount of error when 
individual prediction is desired can be 
gathered only from the percentage of 
subjects incorrectly identified. In Table 
5A, when weighting systems p=.o1 was 
eliminated because it was the only one 
which gave somewhat atypical results, 


the median of the other seven weighting. 


sets indicate that 30% of the subjects in 
the N-AR comparison pair are incor- 
rectly identified, as are 20% in the N-S 
comparison pair and 25% in the AR-S 
comparison pair. 

However, it must be recognized that 
these percentages are based on the "best" 
critical score of the cross validation dis- 
tributions. This procedure weights all 
sampling errors in favor of a greater 
discrimination, and thus gives a spuri- 
ously high validity. To properly estimate 
validity, the critical score should be 
based on a previous population. There- 
fore, the best critical score was found 
from the disuibutions of population C23 
scored with weighting system p=-10 
based on criterion population C123- 
These critical scores proved to be +2 for 
the N-AR comparison pair, —21 for 
the N-S comparison pair, and either 
—7 or —8 for the AR-S comparison 
pair. There were two possible critical 
scores for the last pair because both gave 


the same percentage of subjects incor , 


rectly identified. 

At the above cutting scores there 
would be 30% of the N-AR and 25% 
of the N-S comparison pairs incorrectly 
identified. Since a cutting score of —8 
would result in 31% of the subjects being 
incorrectly identified, while a score of 
=| would result in 29% of the subjects 
being incorrectly identified, it would ap- 
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pear best to estimate the percentage of 
subjects incorrectly identified in the 
AR-S comparison pair at 30%. 

It should be recognized that using the 
results of only one rather than a series of 
samples to establish the critical scores, 
and having too few subjects in the distri- 
butions for smooth curves may lead to 
an underestimate of the validity. There- 
fore, the best possible estimate of the 
validity may lie somewhere between the 
above results and those obtained when 
using the best critical score of the cross 
validation distribution. However, exactly 
where in between these points it would 
lie is impossible to determine, and it 
would appear safest to conclude that the 
SMCR will incorrectly identify at most 
30% of the N-AR comparison pair, 
2595 of the N-S comparison pair, and 
30% of the AR-S comparison pair. 


5. Classifying Subjects According to 
Tri-Scores 

In the above estimate of validity, the 
subjects in a comparison pair were scored 
only with the set of weights derived from 
the distributions of that comparison 
pair. However, these subjects could also 
be scored with the weights derived from 
the other two comparison pairs. Thus, 
it was possible to score each subject with 
three sets of weights, one from each com- 
parison pair, and thereby classify him: 

1. As normal (N) or as anxiety reac 
tion (AR) from the results obtained with 
the N-AR comparison pair scoring key. 

2. As N or S from the N—S compari- 
son pair scoring key. 

3. As AR. or S from the AR-S com- 
parison pair.scoring key. А 

Each subject could then be classified 
in one of the following eight Ways: 
(These classification combinations will 
be referred to in the future as tri-scores.) 


1. As N on the N-AR key; N on the N-S 


key; and AR on the AR-S key. NNA 
2. N (N-AR key); N (N-S key); S (AR-S 

key). NNS 
3. М (N-AR key; S (N-S key) AR 

(AR-S key). NSA 
4. N (N-AR key); S (N-S key); S (AR-S 

key). NSS 
5. AR (N-AR key); N (N-S key); AR 

(AR-S key). ANA 
6. AR (N-AR key; N (N-S key); S 

(AR-S key). ANS 
7. АК (N-AR key); S (N-S key); AR 

(AR-S key). ASA 
8. AR (N-AR key); S (NS key; S ` 

(AR-S key). ASS 


'To determine the validity obtained by 
using these tri-scores, the same validation 
procedure was followed as when subjects 
in a comparison pair were scored only 
with the set of weights derived from that 
comparison pair. 

Population Ce was used as the criter- 
ion population, and the tri-scores of this 
group were found. The diagnostic iden- 
tification of these subjects was based on 
whether they scored above or below a 
critical score. For each comparison pair 
the critical score was determined from 
the distributions of the two diagnostic 
groups within that comparison pair. (In 
all comparison pairs, the point at which 
the critical score fell proved to be the 
same for population C2 as had been 
found for population C23.) 

For the cross validation population, 
two separate tri-score distributions were 
obtained; one where the diagnostic clas- 
sification of the subjects was based on a 
critical score derived from the distribu- 
tion of the cross validation population 
itself (Pop. X), and the other where the 
diagnostic classification was based on the 
point selected as the critical score in 
population Сэ. (Pop. X^). The tri-score 
distributions are presented in Table 6. 

The tri-score weights were determined 
for each comparison pair by the tri-score 
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TABLE 6 
NUMBER or SUBJECTS IN EACH TRI-SconE CLASSIFICATION 


Population C2 
(Classified according to 


Population X 
(Classified according to 


| Population x’ 
(Classified according to 


Tri-Score own "best" critical score) | own "best" critical score) C2 critical score) 

N an | s | N |aR| s | w | aR | s 
NNA | 39 | тг Š | 2 | wm | 3 32 | 1 | 8 
NNS | o | I ° I 1 ° | >” | ° | 1 
NSA | E || ї | + S | 4 | 6 | * | 6 
NSS | 6 | o 14 | 6 o 20 | £ || ° I2 
ANA I 8 ° $ 6 ° | 4 | 9 | 3 
ANS o | o f | o | 1 ° ° | * | o 
ASA | o | zo $ | s] sx 3 | 3 12 7 
ASS | E 4 8 25 5 12 20 | 3 | 8 | 13 


distributions of the two diagnostic 
groups in a comparison pair. For ex- 
ample, in the N-AR comparison pair, 
the tri-score ММА was considered as а 
normal score since the population Ce 
section of Table 8 reveals that 89 mor- 
mals and only 12 anxiety reactions made 
this tri-score. For two tri-scores, NSA and 
ANS, there were the same number of 
normals and anxiety reactions making 
these scores. In such cases, the tri-score 
was weighted according to the results of 
the comparison pair involving the two 
groups concerned. Thus, in the N-AR 
comparison, the results obtained with 
the N-AR set of weights would deter- 
mine whether a subject would be classi- 
fied as normal or as anxiety reaction. 

It can be seen in Table 7 that for both 
group X and X', the % Wrong using 


only the critical score in a subject's com- 
parison pair gives about the same results 
as the % Wrong using the tri-scorc, The 
very limited improvement which results 
when the best possible combination of 
tri-scores is found for each group indi- 
cates that little could be expected from 
using a criterion population larger than 
Сз in establishing the tri-score weights. 

However, the tri-score is not entirely 
without advantage. Thus, for subjects 
receiving the tri-score NNA, a statement 
could be made that they are most likely 
to be normals; but that if they are not 
normals, then they are more likely to be 
anxiety reactions rather than schizo- 
phrenics. There are a sufficient number 
of subjects with tri-scores NNS, ANA 
ASA, and ASS to make similar interpre 
tations possible for subjects in these clas- 


TABLE ; 


PERCENTAGE OF WRONG DISCRIMINATIONS OBTAINED WITH CRITICAL Scores 
ONLY, AND WITH TRI-SconEs 


Population C2 


Population X Population X^ 


м [aR] s |N AR|s|N]|an| 8 

With critical score only | 18 16 19 30 21 24 30 25 3£ 

At best tri-score combination 17 15 | 7 29 20 22 30 25 27 
At tri-score combination using | | | | 

C2 tri-score wts. | | | 3o 21 | 24 | 3 25 | 29 
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Sifications. Of the 134 subj&ts in popu- 
lation X’ receiving these tri-scores, а pre- 
diction as to which of the three diagnostic 
groups a person is not in would be incor- 
rect in only 17, or 13%, of the cases. Had 
the diagnostic prediction of the 16 sub- 
jects not receiving any of the above tri- 
scores been made on a chance basis, the 
percentage incorrectly identified would 
still be under 17%. 


6. Comparison of A Priori Judgments 
as to Diagnostic Significance of Res- 
ponses with Empirical Results 


The a priori judgments were com- 
pared to the empirical results in regard 
to the diagnostic implications (normal, 
anxiety reaction or schizophrenic) of the 
various percepts. The results of this com- 
parison are presented in Table 8. Listed 
at the left are the diagnostic categoriza- 
tions which were determined empirically. 
These categorizations were based on the 
results obtained with the entire experi- 
mental population. The letter “O” indi- 
cates those percepts which did not re- 
ceive any weights (using the p = .10 
weighting system), and hence could not 
be considered characteristic of either 
diagnostic group. | 

The a priori judgments as to diagnos- 
tic implication are presented along the 
top of the table. The “?” stands for the 
alternative "something else," while the 
letter “F” is the response “failure to s€- 
lect one of the alternatives in a question.” 
Separate results were computed for Test 
AB and Test С. In the former, it was 
possible to determine the empirical 
Weight for each percept exactly. In the 
latter, however, this determination had 
to be made qualitatively. 

In Test C, the subjects checked one 
of five possible ratings for each percept, 


and a separate weight was found for 
cach of these five ratings. For some per- 
cepts, the diagnostic implication was 
quite clear since the first few rating steps 
were characteristic of one diagnostic 
category, while the last few were charac- 
teristic of the other diagnostic group. 
However, the picture generally was more 
confused. For example, the first and 
fifth steps might be given a normal 
weight, the fourth step an anxiety reac- 
tion weight, and the second and third 
steps given no weight. With results such 
as these, it was impossible to define the 
percept as characteristic of normals or of 
anxiety reactions. In many cases, border- 
line decisions had to be made, such as 
when of the five rating steps only the 
first or second step was found to be 
weighted. Included in the present table 
were only those percepts whose diagnos- 
tic implications seemed fairly clear. 

Table 8 reveals that for the N-AR 
comparison pair there is no correlation 
between a priori judgment and the em- 
pirical results. 

Of the бо percepts considered to be 
characteristic of normals on Test AB, 
only g proved to be so empirically, and 
in four cases the supposed normal per- 
cepts were found to be characteristic of 
anxiety reactions. Of the 80 percepts 
judged to be characteristic of anxiety 
reactions, only 6 were so empirically, 
while 13 were found to be characteristic 
of normals. Thus, in only 15 of the 140 
percepts did the a priori judgment coin- 
cide with the empirical evaluation, while 
in 17 cases they were exactly the op- 

1It should be clearly recognized that the au- 
thorities who estimated the psychopathological 
significance of the responses did so on the basis 
of their experience with the individual Ror- 
schach, and they made no claims that the sig- 


nificance would remain the same on a multiple 
choice test. 
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posite. Very similar results were also ob- 
tained on Test C. 

For the other two comparison pairs, 
there appears to be a positive correlation 
between a priori judgment and empiri- 
cal evaluation. In the N-S comparison, 
of the бо percepts presumed to be nor- 
mal on Test AB, 16 actually were so with 
only 5 being exactly the opposite; that 
is, being characteristic of schizophrenics. 
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29 percepts хоіпсійе, and 25 are exactly 
the opposite. 

It is difficult to use a single statistic to 
indicate the amount of correspondence 
because a lack of coincidence between 
the a priori judgment and the empirical 
results may be expressed in two ways: 
the estimated percept may prove to be 
significant of neither diagnostic category; 
or, what appears to be more serious, it 


TABLE 8 
CORRESPONDENCE BETWEEN EMPIRICAL AND А PRIORI CATEGORIZATIONS 
A Priori Categorization 
Comparison Empirical 
Lair Categorization Test AB Test C 
NlaA|s ? К | | А 5 Е 
ы 
Normal W 9 13 6 2 ° I4 I2 2 ° 
N-AR Anxiety Reaction 4 6 19 4 I 4 5 16 5 
Neither 47 61 55 14 59 
T 
Normal В 16 19 6 4 ° 17 8 6 o 
N-S Schizophrenic 5 12 29 5 35 13 9 14 67 
Neither 39 49 45 II 25 
- me 
Anxiety Reaction I4 15 10 2 o 18 19 I o 
AR-S Schizophrenic 3 15 14 I 26 12 14 23 45 
Neither 43 50 56 17 34 


Of the 8o percepts estimated as schizo- 
phrenic, 29 were found to be so empiri- 
cally with only 6 having exactly the op- 
posite diagnostic significance. Thus, in 
45 of the 140 percepts the a priori judg- 
ments and the empirical results coin- 
cided, while they were exactly the oppo- 
site for only 11 percepts. The results of 
Test C, though, are not quite as good, 
with only 31 of the percepts coinciding 
while 19 were exactly the opposite. 

In the AR-S comparison it is Test C 
which shows the greater amount of cor- 
respondence between a priori judgment 
and empirical evaluation. Here 42 of the 
percepts coincide, with only 15 being 
exactly the opposite. However, little 
relationship is shown in Test AB where 


may turn out to be significant of exactly 
the opposite diagnostic category tham 
was expected. 

In general, it may be said that a posi- 
tive correlation exists between a prior! 
judgments and empirical results since 
these two coincide more frequently, on 
the whole, than they are exactly the op- 
posite. However, this correlation seems 
to be fairly low as can be seen from the 
relatively few times a percept is found 
empirically to have exactly the same 
significance as was expected on an а 
priori basis. Thus, while a priori judg: 
ments may be of value in the initial se 
lection of items for a multiple choice 
projective test, there can be no question 
but that it will be essential that the 
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psychopathological significance of cach 
percept be determined empirically. 
However, the data in Table 8 also in- 
dicate that a detailed analysis of those 
percepts found to be significant of a diag- 
nostic category may uncover general 
principles applying to multiple choice 
projective tests which could materially 
increase the validity of a priori judg- 
ments. For example, in the N-AR com- 
parison, the a priori estimations would 
have been much more accurate had those 
percepts which were considered to be 
characteristic of schizophrenics been 
judged to be indicative of anxiety reac- 
tions, while those percepts predicted as 
typical of anxiety reactions were instead 
considered characteristic of normals. 
This holds true for both Test AB and 
Test C. Similarly, in the AR-S com- 
parison, the  percepts which were 
thought to be indicative of anxiety reac- 
tions proved to be much less discrimi- 
nating than the percepts which has been 
assumed to be characteristic of normals. 
Also apparent from Table 8 is the fact 
that the alternative. "Something else" 
(?) does not appear to differentiate any 


diagnostic group, while the "Failure to 
select an alternative" (F) seems to be 
primarily a schizophrenic response. 


7. Comparison of Conditions Varied 
within the Test Battery 


As outlined previously (Section ILB, 
2), tne experimental design included 
variations in certain of the test condi- 
tions. By comparing the results of differ- 
ent sections of the test battery, the effect 
of these variations could be determined. 
The relative discrimination obtained 
under the different test conditions is 
shown in Table 9. While the amount of 
discrimination under each condition was 
measured by both the critical ratio and 
the % Wrong (percentage of subjects in- 
correctly identified), only the critical 
ratios are listed because of the extremely 
high correlation between these two sta- 
tistics. Preference was given to the criti- 
cal ratio because it would seem to be the 
more reliable statistic. 

a. Differences in the structure of the 
questions. 

(1). Relative vs. absolute decision 
by the subject. In Test A and Test B, 


TABLE 9 
COMPARISON OF CONDITIONS VARIED WITHIN THE TEST BATTERY 


Critical Ratios for Different 
"m абсо Comparison Pairs 

N-AR N-S AR-S 
Relative vs. a te decision | Test AB (relative) 3-4 5.5 4.0 
by pen аро Test C (absolute) 3.0 7.8 4-1 
Diff. i [ percepts | Design I (forced normal) 9.7 4-1 2.7 
[елее pre of percepts | Design II (unlimited) 3.8 uy 2.8 
Design III (forced non-normal) 1.6 4.6 3.0 
Briet эв. xposure of | Test A (brief) 2.6 4.1 3.5 
ш y d Test B (lengthy) Ма 5:9 2m 
No. ; i test | Designs I, П, III (20 questions) 2.0 4.5 2.8 
Ж in the test | ТОРА B (30 questions) “Фф 59 3.9 
Test AB (6o questions) 3:4 5.5 4.0 
Test C (100 questions) 3.0 7.8 4.1 
Test ABC (160 questions) 3-9 7:1 4.8 
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which have been combined to form Test 
AB, the subjects were required to select 
one of four alternatives, thus making a 
relative decision among these alterna- 
tives. On Test C, the subjects rated each 
alternative on a five point absolute rat- 
ing scale. As can be seen from the results 
in Table 9, the type of decision made by 
the subjects appears to have little effect 
upon the amount of discrimination. 
While Test AB is somewhat superior for 
the N-AR comparison pair, Test C has 
a higher critical ratio for the N-S com- 
parison pair, while the results are prac- 
tically identical in the AR-S compari- 
son pair. 

(2). Differences in the types of alter- 
natives included in a question. Results 
were obtained for three different ques- 
tion "Designs," each of which was struc- 
tured to include only certain types of 
percepts (section ILB,2a [2]. It was 
necessary to select percepts on the basis of 
an a priori judgment as to their psycho- 
pathological significance, and, as has 
been shown, such a priori judgments do 
not correspond to the psychopathologi- 
cal significance found empirically. 
Hence, the original rationale for this 
comparison no longer exists. However, 
there remains a comparison between 
questions in which the subjects are re- 
quired to select a specific percept (Designs 
I and III), and questions in which the 
subject may avoid selecting a specific 
percept by checking the alternative, 
"Something else," (Design П). 

From the results it is seen that Design 
II was superior for the N-AR compari- 
son pair, while Design III was best for 
the N-S comparison pair. Little differ- 
ence was found among the designs in 
the AR-S comparison pair. While no 
superiority seems to be shown for either 
the “forced-choice” or the "unlimited- 


choice" question designs, any conclu- 
sions should be made cautiously since 
these results were based on only 20 ques- 
tions of each design. 

b. Differences between a brief and 
lengthy exposure of the stimulus. In 
Test A the stimuli were exposed for а 
relatively brief period. In Test B, which 
was equivalent to Test A in other re 
Spects, the subjects were given as much 
time as they desired to view the stimulus. 
As shown in Table 9, Test B produced 
greater discrimination in the N-AR 
and N-S comparison pairs, while ‘best 
A was superior in the AR-S compari- 
son pair. These results do not appear to 
be definitive enough to warrant drawing 
any general conclusions. It remains po* 
sible that the differences found represent 
true differences, and that opposite results 
are obtained in different comparison 
pairs because "optimum" conditions are 
different for each pair of diagnostic 
groups. 

с. The effect of increasing the length 
of the test. By considering different 5€ 
tions of the test battery separately, it waš 
possible to determine empirically the 
effect of increasing the length of the test. 
Designs I, II and III each included only 
20 questions, so the mean of the critica 
ratios obtained with these “Designs” W^? 
used to estimate the discrimination prO 
duced by 20 questions. The amount © 
discrimination resulting from 30 que 
tions was determined by the mean of ше 
critical ratios obtained with Test А an 
Test B, each of which contained that 
number of questions. When Tests ^ 2? 
В were combined to form Test AB, thet 
Was a total of Go questions. In Test 
there were 100 questions, while the com 
plete battery, Test ABC, contained qne 
questions. The fact that the types ? 
questions were different for different test 
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lengths was not considered? contaminat- 
ing since it would appear from the pre- 
ceding results that the type of question 
has little effect upon the amount of dis- 
crimination. 

As can be seen in Table 9, there is, in 
general, a steady increase in discrimina- 
tion for each comparison pair as the 


Wrong of the three comparison pairs 
drops from 35% for the 20 question sec- 
tion to 25% for the 160 question battery. 


8. The Effect of Age and Education 


Since there were some differences 
among the three diagnostic groups with 
respect to age and amount of education, 


TABLE ro 


COMPARISON OF SUBJECTS INCORRECTLY Ip 


STIFIED AND SUBJECTS CORRECTLY IDENTIFIED, 


WITH RESPECT TO AGE AND AMOUNT OF EDUCATION 


" i Age Comparison Education Comparisor 
Diagnostic Compar. Group Age Compari panaan 
Category Pair N Mn | Mdn | N* Mn | Mdn 
N-AR Incorrect 10 25.2 | 25.2 10 11.7 | 12.5 

Correct | 40 25.8 | 24-3 до 12.5 | 2.7 

Normals N-S Incorrect 10 25.5 23.5 | 10 11.6 11.7 
Соггесї 40 25.7 25.I 40 12.5 | 12.8 

Both Pop. X 50 25.6 24.7 50 12.3 12.7 

Total Pop. 250 25.0 23.8 245 i32 12.8 

N-AR Incorrect 10 33.3 33.0 10 12.3 13.0 

Correct 40 30.1 29.5 40 10.6 | 10.6 

Anxiety AR-S Incorrect 10 35.1 36.0 10 8.8 8.8 
Reactions Correct 40 29.6 20.8 | 40 $i 12.1 
Both Pop. X | so 30.7 30.5 50 10.9 10.9 

Total Pop. 203 30.0 29.6 197 | IT.4 | 11.5 

N-S Incorrect 10 27.3 | 25.5 8 $4.5. 12.0 

Correct 40 29.2 28.5 34. її.О IIr.6 

Schizophrenics | AR-S Incorrect то | 26.6 25.5 9° тт.8 тї 
Correct 40 29.4 28.8 33 ix. 11.6 

Both Pop. X 50 28.8 28.0 42 II.3 11.6 

Total Pop. 200 28.4 27.3 179 Iba 11.0 


* For some subjects information as to the amount of education was not available. 


length of the test increases. Were the 
critical ratios of the three comparison 
pairs to be averaged, then the few 
fluctuations would be eliminated, and 
the amount of discrimination. would 
show a regular progression. The impor- 
tance of the length of the test upon indi- 
vidual prediction can be gathered from 
а comparison of the percentage of sub- 
jects incorrectly identified. The mean % 


it was possible that the discrimination 
obtained between diagnostic groups was 
a reflection, in part at least, of these age 
and educational differences. In Table 
10, the correctly and incorrectly identi- 
fied subjects (defined in section V,B,1,b) 
in each comparison pair are differenti- 
ated with respect to age and amount of 
education. For purposes of comparison, 
data are also presented for all of popula- 
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tion X, which includes both correctly 
and incorrectly identified subjects, and 
for the entire experimental population. 

Each diagnostic group is included in 
two comparison pairs. The results of the 
normals and schizophrenics are very 
similar in both comparison pairs because 
in each case many of the same subjects 
were incorrectly identified in both com- 
parison pairs. Six of the normals who 
were incorrectly identified in the 
N-AR comparison pair were also in- 
correctly identified in the N-S compari- 
son pair. With the schizophrenics, there 
were nine subjects duplicated. However, 
none of the anxiety reactions was incor- 
rectly identified in both comparison 
pairs. (A chance matching of 10 names in 
each diagnostic category would have ,re- 
sulted in two subjects being included in 
both comparison pairs.) 

There seems to be no difference in age 
between correctly and incorrectly iden- 
tified normals. However, the incorrectly 
identified schizophrenics are a few years 
younger than those correctly identified, 
while the incorrectly identified anxiety 
reactions, particularly in the AR-S com- 
parison pair, are an older group. 

The incorrectly identified normals 
have slightly less education, while the in- 
correctly identified schizophrenics are 
slightly better educated. With sot ме 
iety reactions, however, the educationa 
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differences afe marked. Those incorrectly 
identified as normals have about two 
more years of education than those cor- 
rectly identified, while the anxiety reac- 
tions identified as schizophrenics have 
about three years less education. 

From these results it would appear 
that, particularly in the case of the anxt- 
ety reactions, the subjects' responses are 
a product of their educational back- 
ground as well as of their psychopatholog- 
ical condition, The educational back- 
ground may, in turn, bea reflection of 
intellectual endowment. The entire ques- 
tion appears to be too complex to be 
resolved in the present study, particu- 
larly since the inadequacy of the original 
diagnostic classification may also be a 
factor. The chief screening psychiatrist, 
in charge of the panel which diagnosed 
almost all the anxiety reactions in the 
cross validation population, stated that 
most diagnostic errors probably occurred 
when attempting to determine whether 
the older patients with little education 
(i.e., those anxiety reactions identified on 
the SMCR as schizophrenics) were anxi- 
ety reactions or schizophrenics. In any 
case, the overlap between correctly and 
incorrectly identified subjects with re- 
spect to age and education was so great 
that differences in these factors cannot 
be responsible for most of the discrimina- 
tion obtained with the SMCR. 
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VI. DISCUSSION OF THE RESULTS 


ү cross validation, the SMCR in- 
correctly identified at most 30% of 
the N-AR comparison pair, 25% of the 
N-S comparison pair, and 30% of the 
AR-S comparison pair. For all com- 
parison pairs the critical ratios were 
sufficiently high to establish a significant 
difference between diagnostic groups. By 
using tri-scores, the diagnostic category 
a subject was not in could be predicted 
with an error of under 17%. Therefore, 
it would appear that a fair validity had 
been obtained with a multiple choice 
projective test. Since the amount of dis- 
crimination found compares favorably 
with the validity reported for other 
Screening measures, and since there are 
indications that subsequent research 
with multiple choice projective tests will 
lead to even greater discrimination, 
further investigation with this approach 
would seem warranted. 


A. COMPARISON WITH OTHER SCREENING 
DEVICES 

As mentioned previously, screening 
tests have been found to be most effective 
in military screening. Ellis and Conrad 
(10) have presented a detailed summary 
of the results obtained with personality 
inventories in this area. For most stud- 
ies, the amount of validity reported is 
approximately the same as that obtained 
with the SMCR. While there were a few 
reports of unusually high validity, Ellis 
and Conrad suggest that such validity 
coefficients reflect the operation of un- 
usual experimental or statistical factors. 

The possible effect of improper cross 
validation, an experimental error found 
in many studies, has been well illustrated 
by data obtained in the present investi- 
gation. When a population of 5o sub- 
jects in each diagnostic group (C1) was 
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scored with weights derived from the 
same population, the scores from just 
half the test battery resulted in only 
about 8% of the subjects being incor- 
rectly identified. When weights derived 
from a criterion population of 150 sub- 
jects in each diagnostic category (C123) 
were used to score one-third of these sub- 
jects (C2 and C3), the percentage of 
subjects incorrectly identified rose to an 
average of 18.595. When the weights 
derived from this criterion population 
were applied to a cross validation group, 
the percentage of subjects incorrectly 
identified at the best critical score of the 
cross validation group became 25%. 
When a critical score derived from the 
criterion population was employed, the 
average percentage of subjects incorrectly 
identified rose to 28%. 

Zubin (60), in another article evaluat- 
ing screening tests, presents the results 
obtained at the Sheepshead Bay U. S. 
Maritime Training Station as an ex- 
ample of the marked effectiveness of a 
questionnaire in military screening 
which generally has produced better re- 
sults than civilian screening. Of 1,000 
men processed, 660 were passed without 
a further interview on the basis of a 
screening personality questionnaire. 
Three of those passed had to be dis- 
charged from the service during their 
training period for psychiatric reasons. 
Of the 340 men selected for a psychiat- 
ric evaluation, 15 had to be discharged 
from the service, 75 had temporary or 
mild emotional conditions which re- 
quired short term therapy, while the re- 
maining 250 turned out to be false posi- 
tives. 

In evaluating the above data, the 75 subjects 


who adjusted after short term therapy are diffi- 
cult to place. There may have been an equal 
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proportion of subjects among the persons passed 
without a psychiatric interview who had com- 
parably mild emotional conditions, but who 
were able to adjust without any short term 
therapy. If these 75 subjects were therefore to 
be considered as false positives, there would be 
а total of 1.000 subjects, 982 passing and 18 
being discharged. Of these, 33% (325 passing 
and 3 discharged) were incorrectly. identified. 
Eliminating the 75 subjects from the data would 
lower the percentage of subjects incorrectly 
identified to 27%. 

Not only are the above percentages about the 
same as those obtained with the SMCR, but 
they probably give a spuriously high estimate 
of validity. Only persons making high question- 
naire scores would have been discharged (on 
the basis of a psychiatric examination) without 
participating in the regular training program. 
Ellis and Conrad report a study by Weinstock 
and Watson "on 121 Naval recruits who were 
allowed to remain in service (‘on trial’) despite 
an adverse prognosis based on clinical judg- 
ment. Of the 121 recruits, only 44 (36%) were 
discharged for neuropsychiatric reasons during 
recruit training” (10, р. 386). ^ 


Another factor which frequently leads 
to spuriously high validity estimates is 
the use of extreme groups. With such 
populations almost any screening meas- 
ure can obtain a fair validity. As Huston 
and Shakow found in a study on schizo- 
phrenics, while the patient groups per- 
formed at a level significantly poorer 
than the control group on two indices 
of steadiness, "when these patients who 
cooperated as well as the normal group 
were compared with the latter, the differ- 
ences between the 8roups disappeared” 
(25, p. 120). It was for this reason that in 
the present study care was taken to test 
only the “best” schizophrenics, and to 
use anxiety reactions who were not so 


severely disturbed as to Warrant hospital- 
ization, 


renic subject 
filled out Tests A and wma ae 
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cording to the established criteria, but 
who failed to fill out Test С properly. 
The greater failure on Test C resulted 
from the fact that it required a some- 
what more complex response and came 
last in the testing battery when increased 
boredom and fatigue had set in. When 
the scores of the 48 schizophrenics who 
failed to complete Test С were substi- 
tuted for the regular schizophrenic cross 
validation population in the Test AB 
comparisons, the percentage of subjects 
incorrectly identified at the best critical 
score dropped from 24% to 10% in the 
N-S comparison, and from 32% to 22% 
in the AR.S comparison, Furthermore, 
had the SMCR been administered so 
that a higher Proportion of schizophren- 
ics would have responded, such as under 
individual conditions or with proctoral 
assistance, the percentage of subjects in- 


correctly identified would undoubtedly 
have been even smaller, 


B. INDICATIONS FOR GREATER POTEN- 
TIAL DISCRIMINATION 


While there has been a tremendous 
amount of experimentation with ques- 
tionnaires Extending back to the first 
World War, relatively little work has 
been done with multiple choice projec- 
tive tests, Indications exist from the pres- 
ent study that further research with the 
multiple choice projective approach may 
lead to a marked increase in the amount 
of discrimination produced. 

Firstly, the present study did not at- 
tempt to exhaustively sift questions until 
а test composed of the most discrimina- 
tory choices was evolved. By far the 
greater number of responses received no 
Weights, even under the most liberal 
weighting system. Yet the greater amount 
of discrimination occurring as the size 
of the test Was increased clearly indicates 
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that the replacement of noa-discrimina- 
tory percepts should add even more to 
the validity. 

Secondly, only one projective method, 
the Rorschach, was used. It is recognized 
in individual psychopathological testing 
that much more validity can be obtained 
from a battery of different projective 
tests than from any single instrument. 
As Rapaport and Schafer have stated: 
"... as an exclusive basis of diagnosis 
the test (the Rorschach) has serious limi- 
tations and its real potency is exploited 
only when used as the center of a battery 
of tests" (42, p. 78). 

Thirdly, a loss in validity undoubtedly 
Occurred because of inadequate diagno- 
ses of subjects in the three diagnostic 
classifications. The Cornell Index, which 
Was used as the criterion of normality is 
itself only a rough screening device, and 
it has not proved too effective in the 
screening of prepsychotic and early 
psychotic states. Norms for Form Ni of 
the Cornell Index could not be obtained. 
However, 10 or fewer poor responses on 
Form N2 which contains 37 additional 
questions and consequently provides 
more opportunity for poor responses, 
will be made by 14% of the persons re- 
jected for military service for psychiatric 
reasons. Thus, it is probable that some 
subjects in the “normal” population 
were poorly adjusted. Furthermore, some 
of the schizophrenics were in remission, 
or at least well enough adjusted to be 
released from the hospital within a few 
weeks of their testing, while others had 
à great deal of free floating anxiety char- 
acteristic of anxiety reactions. The anxi- 
ety reaction group was perhaps the most 
Poorly diagnosed, containing many indi- 
viduals who would not have contem- 
plated psychiatric treatment had they 
not been veterans, as well as persons 


whose anxiety was symptomatic of a 
latent schizophrenia. Almost all the 
anxiety reactions in the cross validation 
population were selected only on the 
basis of a screening or tentative psychia- 
tric diagnosis. 

Of course, most subjects in each diag- 
nostic category were probably correctly 
classified. However, the number of sub- 
jects incorrectly identified by the test was 
relatively small, so that the inclusion of 
some misclassified persons in this group 
would make an appreciable difference. 
It is recognized that the misclassified 
subjects may not be the persons incor- 
rectly identified by the test. However, 
the possibility appears to be sufficiently 
great to warrant a more thorough evalua- 
tion of the subjects in each diagnostic 
category before the maximum potential- 
ity of the test can be estimated.? 


!Since a year had elapsed between the testing 
of the subjects and the preparation of this 
monograph for publication, an attempt was 
made to check the accuracy of the diagnosis 
established at that time by a follow-up study. 
No further information could be obtained for 
the normal subjects since they had filled out the 
tests anonymously. However, psychiatric progress 
reports were available for most of the anxiety 
reactions and schizophrenics. : 

The majority of the anxiety reactions tested 
were subsequently treated by psychiatrists work- 
ing in private clinics under contract to the Vet- 
erans Administration. These psychiatrists pre- 
pared monthly reports summarizing the condi- 
tion of their patients, The majority of the schizo- 
phrenics had been tested at the Veterans Hos- 
pital, Northport, L. L, and for these subjects 
complete clinical case folders were available. 

Unfortunately, a careful inspection of this 
data was not helpful in improving the validity 
of the original diagnosis. While almost all of 
the subjects retained their initial diagnosis, a 
consideration of their symptomatology revealed 
much overlapping between subjects diagnosed 
as anxiety reactions and those diagnosed as 
schizophrenics, It appeared that psychiatrists at 
out-patient clinics generally preferred to make 
a diagnosis of anxiety reaction while those at 
neuropsychiatric hospitals were more likely to 
make a diagnosis of schizophrenia. This is in 
agreement with the experience of the writer at 
both types of installations, and with the opinions 
of several psychiatrists consulted on this ques- 
tion, Thus, subjects with identical symptoma- 
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Finally, a multiple choice projective 
test combined with other types of tests 
in a battery may result in a greater 
amount of discrimination than would 
exist for any of the tests separately. For 
example, a test such as the SMCR which 
resulted in greater discrimination be- 
tween normals and psychotics might be 
combined with a questionnaire such as 
the Cornell Index which is more effective 
in differentiating normals from neu- 
rotics, Subjects might also be given an 
intelligence test, and their multiple 
choice projective test responses inter- 
preted according to their LQ. It is recog- 
nized that responses on the free associa- 
tion Rorschach will depend upon a sub- 
ject's intelligence as well as his psycho- 
pathological condition. A "vague" or 
"mediocre" response which might be 
considered normal for a subject of low 
intelligence would be indicative of mal- 
adjustment in one of high intelligence. 
The same would appear to be true of the 
multiple choice Rorschach. In the pres- 
ent investigation, the large difference in 
education between the anxiety reactions 
identified incorrectly as normals and 
those identified incorrectly as schizo- 
phrenics probably reflects a différence in 
intelligence rather than in education per 
se. Therefore, it might be advisable to 
develop different sets of Scoring weights 
for different levels of intelligence, 


C. SUGGESTIONS AS TO FUTURE RESEARCH 
From the results of the present study, 


both quantitative and qualitative, a 


tology might be diagnosed differently at different 
installations. 

A further attempt to validate the SMCR on 
schizophrenics by comparing the scores of sub- 
jects who had been discharged with those who 
still remained in the hospital was abandoned 
when it was found that in a population of the 
“better” schizophrenics, discharge was deter- 
mined as much by the patient's home environ- 
ment as by his psychiatric condition. 


number of Suggestions arise as to the 
course of future investigations. 


1. Subjects 


In the present study, weights based on the 
results of only 5o subjects in each diagnostic 
category proved inadequate. On the other hand, 
a criterion population of 150 subjects per diag- 
nostic category was no more effective than one 
of 100 subjects. But caution would be necessary 
in applying these results to other investigations. 
For example, a smaller number of subjects might 
be required when only two or three responses 
are possible for each question rather than the 
5 or 6 in the SMCR, or when a psychopathologi- 
cal variable less heterogencous than diagnostic 
categories is investigated. However, the lack of 
Very marked differences between diagnostic 
Broups in their responses to any question in 
the SMCR indicates that a fairly large number 
of subjects will probably be required to estabiish 
reliable scoring weights, 

Not only should the subjects in both criterion 
and cross validation populations be very com- 
petently diagnosed, but it would be valuable 
to have them evaluated psychiatrically on a 
wide variety of variables. In addition to noso- 
logical classification, the subjects could be rated 
upon significant personality traits, such as hos- 
tility or dependence, and upon dynamic factors 
Such as sibling rivalry or parental rejection. 
Then, the same subject population could be 
Tegrouped accordingly when studying each dif- 
ferent psychopathologica] variable. A prognostic 
index could also be developed by reconsidering 


the data after the Progress of the subjects had 
been observed, 


2. Stimuli 
Further use of the Rorsch 
stimuli in preference to other 
seem inadvisable. As 
shown, there is little co 


the psychopathologica] igni 
between a priori j 


ach inkblots as 
inkb!ots would 


s Particular inkblot der free 
“a я О» s under 
cation test conditions would be of little 


tion of a multiple choice 


< Performance on 
tion Rorschach.» 


? For thi 
SMCR ped ar the scoring stencils for thc 
report. It was felt j, included in the present 
Ct that the availability of these 
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. While no quantitative evidenée is available, 
it was the experience of the writer and of sev. 
eral other clinical psychologists who tested sub- 
jects individually subsequent to the SMCR that 
the performance of these subjects had been 
influenced by the preceding multiple choice 
Rorschach, The evaluation of the subjects’ free 
association protocols was hindered by the diffi- 
culty in assessing the direction and extent of the 
influence of the multiple choice test. 

Another advantage in using other than the 
Rorschach inkblots is the opportunity to con- 
struct inkblots particularly suited for a multiple 
choice test. Such blots could be structured to 
emphasize responses to cither large arcas or the 
blot as a whole. In the present experiment, the 
subjects rarely studied the slides minutely, but 
based their responses on a broad general im- 
pression. (Of course, it is possible that a more 
detailed examination of the stimuli would have 
been made had the subjects been more highly 
motivated or had the inkblots been presented 
in booklet form.) “he inkblots could be struc- 
tured to emphasize certain determinants, such 
аз Levy's graduated "movement" series. (61) , or 
they might present а choice between determi- 
nants such as by containing two possible per- 
cepts, one form determined and the other 
predominantly color determined. It may be 
desirable to draw inkblot designs with printer's 
ink, as was done by Terman and Miles (52), 
rather than to use actual inblots. While it would 
be desirable to explore the potentialities of in- 
definite figures placed on simple backgrounds, it 
is very possible that better results are obtained 
when both figure and ground are indefinite, as 
with inkblots. This reservation should be kept 
in mind for all types of projective stimuli. 

In addition to inkblots, various other pro- 
jective stimuli should be investigated. Here 
again, it would not appear advisable to use the 
same stimuli as in the free association tests, 
but rather to develop new stimuli which to- 
gether with the multiple choice responses could 
be structured in the direction of a specific 
Psychopathological variable. For example, rather 
than employing the TAT cards, special pictures 
could be drawn. To investigate the maternal 
relationship, a picture of a mother and a child, 
the former with her arm upraised, might be 
shown. The responses could include one in 
which the mother is petting the child, another 
in which she is about to strike the child, and 


stimuli might lead to an indiscriminate use of 
the test, thus exposing many persons to me: 
gested responses with the Rorschach blote Ше 
Scoring stencils will, of course, be made availa» e 
to any qualified psychologist who wishes to per- 
огт a confirmatory or further exploratory ex: 
periment, 


a third in which she is waving to someone be- 
hind the child. Of course, the picture would 
have to be sufficiently vague so that the subjects’ 
responses would depend upon their projections. 
With the Rosenzweig Picture Frustration Test 
(43. 44) which is already structured to measure 
a specific variable, that of frustration tolerance, 
it might be necessary only to develop a set of 
multiple choice responses for each of the pic- 
Lures. 

Geometric figures, such as those employed in 
the Bender Gestalt (1) and the Franck Symbolic 
Sex Preferences (11) could be used. Here, the 
alternatives might consist of other geometric 
figures, with the subjects required to select the 
figure most closely resembling the stimulus. 
Perhaps particularly well suited for the detec- 
tion of paranoid tendencies wou!d be the Tauto- 
phone (13) in which a recording of low-voiced 
nonsense syllables is played. Subjects are asked 
what the recording is saying, being given to 
understand that it is producing a meaningful 
statement. 

Further work can be done with story plots 
along the lines already developed by Kelly, and 
Bishop, Taylor and Wolfenstein (27, 50, 59). 
Verbal stimuli could also be confined to single 
words in a variation of the word association 
approach. Either fragmented letters or very brief 
exposures during which only one or two letters’ 
would stand out might be employed, with sub- 
jects being asked which word had been dis- 
played. 


3. Administration of the Test 

It would seem desirable to present the stimuli 
as tests of ability, and in fact, to include some 
questions with obviously correct responses to 
further this belief. The best exposure time would 
have to be worked out separately for each dif- 
ferent stimulus. While the results of the present 
experiment indicated that the length of exposure 
of the Rorschach slides made little difference 
in the amount of discrimination obtained, no 
conclusion can be reached as to whether this 
will hold true for other stimuli or for othcr 
psychopathological variables which may be in- 
vestigated. 

It may be possible to diminish the effect of 
suggestion and rationalization in the selection 
of responses and thereby obtain more significant 
projections by removing the stimulus before the 
multiple choices are presented to the subjects. 
For example, in the SMCR this could be done 
by listing the percepts in a question on a second 
slide which would be proj cted on the screen 
after the stimulus had been removed. The sub- 
jects could indicate their choice of response by 
marking the appropriate column on an IBM 
answer form, With this technique, the subjects 
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might be forced to reply more upon their initial 
free associative responses to the stimuli. 


4. Structure of the Questions 


No definitive conclusions could be reached 
from the results of the present study regarding 
the structure of the questions. Similar results 
were obtained when the subjects made a relative 
decision by selecting one of four percepts and 
when they made an absolute decision by rating 
a percept on an absolute five point rating scale. 
Differences due to variations in question “de- 
sign” (that is, the types of percepts contained 
in a question) could not be investigated as 
originally planned because the psychopathologi- 
cal significance of the percepts had been deter- 
mined on an a priori basis, and there proved to 
be little correlation between a priori judgment 
and the empirical results. However, little dif- 
ference was found between “forced choice" 
questions in which subjects had to select a 
specific percept, and “unlimited choice” ques- 
tions in which they could check the alternative 
“Something else.” 

Because of the lack of relationship between a 
priori judgment and empirical findings, it would 
seem advisable in subsequent experimentation 
to determine first, insofar as is possible, the 
psychopathological significance of each response 
empirically from a simple three or four point 
rating scale. Then, after the significance has 
been established empirically, the percepts could 
be fitted into the structures of different question 
designs. These designs could extend from the 
relatively simple one suggested by Malamud and 
Malamud (32) in which the subjects select either 
one of two percepts listed or the alternative 
“neither,” to a design with as many as 10 
percepts, as in the Harrower Multiple Choice 
Rorschach (19). 
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95. Length of Test 


It would seem desirable to have the test as 
long as possible. The present experiment demon- 
strated that the amount of discrimination in- 
creased with the length of the test up to at least 
160 questions, and it may very well continue 
beyond that point. With the use of a variety 
of stimuli rather than a repetition of the same 
19 inkblots, the subjects should not become 
bored or fatigued as quickly. Moreover, it was 
felt that the discriminating power of a stimulus 
decreased as additional questions were applied 
to that stimulus. Therefore, using a higher pro- 
portion of stimuli to questions than in the 
SMCR may also serve to increase the validity. 


6. Scoring of the Test 


Since the five weighting systems employed in 
the present investigation all produced about the 
same amount of discrimination, it would not 
seem to make much difference which weighting 
system were to be used. However, in addition 
to the simple weighting of items, it may be 
advisable to make a Guttman scale analysis (6) 
ОЁ the responses, Few psychopathological vari- 
ables аге "pure" entities, This is particularly 
true of nosological categories, For example, psy- 
chiatrists now prefer to Speak of the "schizo- 
phrenias” rather than of "schizophrenia," be- 
lieving that this diagnosis e 
of only loosely 
many divergent 
lead to the sym 
reaction. A scale ana 


but which do not r 
total score when 
equally, 


е VII. CONCLUSIONS 


1. Further exploration with the multi- 
ple choice projective test approach in the 
detection of psychopathology would seem 
warranted since: 

A. A fair validity has ben obtained 
with a multiple choice projective test. 
When scoring-weights based on a sample 
of 100 subjects in cach diagnostic cate- 
gory were cross validated on a new 
sample of 5o subjects in each category, 
the percentage of subjects incorrectly 
identified by the SMCR was at most 
30% for the normaLanxiety reaction 
comparison, 2595 for the normal-schizo- 
phrenic comparison, and 30% for the 
anxiety rcaction-schizophrenic compari- 
son, There was an error of under 17% 
in predicting which of the three diag- 
nostic categories a subject would not be 
їп, 

В, Split half reliability coefficients, 
When corrected by the Spearman-Brown 
prophecy formula, were .77 in the nor- 
mal-anxiety reaction comparison, .87 in 
the normal-schizophrenic comparison, 
and .72 in the anxicty reaction-schizo- 
phrenic comparison. 

C. There are grounds for believing 
that even greater validity may be ob- 
tained by: 

(1). Further experimentation with 
various multiple choice projection tests. 
(2). Combining a multiple choice 
projective test with a questionnaire and 


an intelligence test in a screening battery. 
IL Of the different test conditions 
varied in the present experiment, only 
the length of the test was shown to be 
significantly related to the amount of 
discrimination obtained. 
ПІ. No significant differences in the 


amount of discrimination could be 
established from: 
A. The application of different 


methods of weighting responses. 

B. Increasing the size of the cri- 
terion population upon which the 
weights were developed from 100 to 150 
subjects in each diagnostic category. 

C. The subjects responding by mak- 
ing a "relative" as opposed to an “ab- 
solate” decision. 

D. The subjects being forced to 
choose among specific percepts in a 
question as opposed to being given the 
"unlimited" alternative, “Something 
else.” 

E. Exposing the stimuli for a brief 
period as opposed to a lengthy period. 

IV. There was little correlation be- 
tween a priori judgments based on the 
free association Rorschach and the em- 
pirical results obtained with the SMCR 
with respect to the psychopathological 
significance of responses. 

V. Both age and education, the latter 
perhaps a reflection of intelligence, may 
affect the subjects’ responses. 


VIII. APPENDIX 


A. THE SINGER MULTIPLE CHOICE RORSCHACH 


NAME 


Test A 
AGE 


e 
——A wolf's head 

— An emblem 

A topographical map 
An octopus 


i 
—— l bomb bursting 
———Lungs and blood 
— Two dogs 

— — Something else 


-7- 
Two red roosters 

а Ribs 

A fish, 


Lungs and chest 


Sip 
— Flag of a country 
Mountain range 
A black waterhole 
— Spilled ink 


و 

————А pair of antlers 

———А butterfly with torn 
wings 

Dried blood 

— Something else 


б 
———A spattered mess 
————А totem pole 

А backbone 
—— A turtle 


—19- 
— Intestines 

—lce formation 

A squashed frog 
—— topographical map 


s= ge 
Two bears climbing 
A biology slide 

— —— A sailboat 

— —— Something else 


а= 
— Ruins of a castle 
A blot 

——A squashed bug 
An X-ray picture 


=5= 
Animal heads 

Torn up butterfly 

A map 

Bears’ heads 


E 

Vertebra 

X-ray of the hips 

Person tearing someone 
apart 

Blood and black 


-j 
———А cow with fur 
—— Two feet 
An X-ray 
— — Something else 


—4— 


—— Legs 
—A horrible object 
А map 
A butterfly 
—17— 
——Ап ой explosion 
————А bad design 


——A cut up mal 


€ Sex organ 
— Black paint 


=so— 
~——Two heads 
——A coastline 
— —-Nothing at all 
- — Something else 


—93— 
——AHead of small dog 


m Two polar bears climbin 
——N Easter egg > 


— —— Modern art 


34 


— GRADE... DATE 


و 
An animal split open‏ — 

— — Line down the center 

А butterfly j 
Something else 


-6-— 
Two equal sides 
Black and red paint 
— — Dark clouds 
———A large суе 


-j- 
Smoke 

—- —— hair ribbon 
— — Nothing at all 
Something else 


moms 
——Necks of geese 
— Two scottie dogs 
————А spinal column 
— —— Sex organs 


—15— 
————A line down the center 
— ——AÀ bug 

——A black mess 

— —— Nothing at all 


-—18-— 
————Ап insect 

— — Jack the Ripper 
—А riverbed 
— — Something else 


c 

—— butterfly 

— Grass 

————А clown's face 

— —— People watching each 
other 


وو 
Colored butterflies‏ —— — 
n X-ray picture‏ — 


— ——An airplane f 
—— A blotch of colors 
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— Flowers 

— — Colored clouds 
— — Birth of a child 
— Sea horses 


„ай 

— — Male sex organs 
— — Leaves 

— — Bugs 


People hanging in air 


ы — 26 — 

—— A dirty mess 

Р Smoke rising 

— — — A bridge in the distance 
-  — Nothing at all 


—29- 


— — Bones 
— — Two spiders 
— — Something else 


Red and blue and yellow 


A person cut up 
A spinal column 
A halloween mask 
Something else 


—30— 

— — —A blue feathery flower 

A terrible germ 

Spiders attacking 
somebody 

— — Deer running 


Test B 
NAME. -— — AOE GRADE DATE = 
=1А – —а@А— —3A— 
—N bat -Burnt piece of paper ——A coastline 
- Female sex organs — — A spine — —-— A dark cloud 


-A staring mask 
—— Something else 


=A= 
— Fire 

— — Two people 

— — A pair of pliers 
— — Smoke 


—7A- 
— — Sex organs 
— — A skeleton ellect 
-Two staring eyes 
—-— Picture from a medical 


book 
— 10A — 
— A giant walking towards 
you 
— Two snakes 
———A furry man 
— — Something else 
—13A— 


— .Head of a mouse 
-Profile of a man 
Antennae 

—А dancer 


NEZ 


An X-ray 
А squashed worm 
— — Water 


—agA-— 
—- A Pacific island 
‘Two women go siping 
~ Male sex organs 
- Something else 


— —A cat's face 
— — —Two people 


—5А— 
— .Human chest with a 
hole in it 
— A squashed bug? 
e Blood 
— — A coastline 


РЕ УРРА 
Waiters bowing 
— A symmetrical design 
— — Animal cut open 
- — Something else 


== 
_ — A dragon skin 
A man sitting down 
— — A gorilla 
— — Snakes 


= ЧА — 
Two buffaloes crushing 
a rabbit 
Two equal sides 
Night 
— — Islands 


— A = 
— Nothing at all 
_ — Picture from an airplane 
— — Flying geese 
. — Something else 


— 90A — 
. А rabbit's head 
Clouds 
Scoitie dogs 
_ — Small dots on an island 


— leaf 
—— Death 


— —— Entrance to a tunnel 
Nothing at all 


— — Something else 


د دو 
—Two devils‏ — 
-Two monkeys hanging‏ 
down‏ 
A spinal cord‏ — = 
Birds’ heads‏ — — 


— 12A — 
— — X relief map. 
— — —Clouds 
A torn chest 
Night 


=A= 

— Dark smoke 

— À Bait 

— The nervous system 
Something else 


= (BÀ 
— — Head of a dog 
— — —A dragon fly 
ÀX cloud 
—A furry snake 


—21A— 
Smoke 
—— Just an inkblot 
A large eye 
a Fog or mist 


36 MARTIN. SINGER 
о 
rU EA — 33А — —24A — 
A spinal column ____Pink, blue, orange — ——Two fish | 
-_ Decorated Christmas tree Snow covered mountain — .  lustration in a medical 
—— — Flowers — — Male sex organs book 
Ribs — ———A mountain в -A coat of arms 
— — Something else 
دوه‎ — 26А — —27A— 
A famous painting — — —The liver ~ — Horns of a deer 
тз Staring еуез — — Blood - — .-Àn explosion 
Red, green, orange — — Two women — —— Intestines 
— — Drawing in a medical —— Something else — — —A man's face 
book 
ЕВА, = 2= — 30А — 
— — Squashed insects — 4 tobacco pipe —— — A Christmas tree 
— — A wishbone — —-—Two spiders — Modern art 
. — Snakes eating a rabbit's " Deer running E: Bloody insides 
head - — Mountain sides — — llustration of different 
— — Something else flowers 
Test C 
NAME. — - —— AGE =s DEDE. 
VERY ONLY A NOT 


EXACTLY MUCH SOMEWHAT LITTLE AT ALL 
LIKE IT LIKE IT — pikg rr. LIKE IT. LIKE TT 


f A а АР uv. ris pera renim e 
2. Two Santa Clauses 
3. Nothing at all .... 
4- À bat ... 

5. Female sex organs ...... 

6. Woman with her arms raised 
7. А staring mask 
8 
9 
о 


. Small islands ... 
. A squashed bug 
. Ruins of a castle 


—H— 

11. A man with his head cracked open 
ESL DUCE aus ds aaa en ataca 
13. Two clowns playing pattycake . 

14. Nothing at all ........ 3 
15. Entrance to a tunnel . 
16. Faces on the edges . 
17. Two equal sides .... 
18. An arrowhead ..... 
19.-À large eye esis "E 
го. A white lamp in the center .......... 


— I — 

21. A person tearing someone else apart .. 
22. A bloody mess 
29. А coastline ...... 

24. A symmetrical design . 
25. Waiters bowing .. 
26. Men picking something up 
ЖЖ ВОЙ Tie ala notre 


28, 
29. 
3o. 


81. 
32. 

33. 

' 34. 
35 

> 36. 


37 


38. A fur rug 


39. 


40; Two boots ......................... SS ayu sss: 


42. 


44- 
45. 
46. 
47. 


48. А mountain range 
49. Nothing at all 
50. A nutcracker 


57. Flying geese .. 


48. A rabbit „оо... еннет А 
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VERY ONLY А NOT 
EXACTLY MUCH SOMEWHAT LITTLE AT ALL 
LIKE IT LIKE IT LIKE IT LIKE IT LIKE IT 


A butterfly .... 
Vertebra .... 
Nothing at all . 


=i 

Head of an animal .. 
А furry man . 
Two snakes ...... س‎ 
A skin stretched out tight 
A mountain range 
A black waterhole .. 
Nothing at all 


A giant walking towards you ........ 


The nervous system 
A line down the center . 


A smashed bod 
Dark smoke .....- —— 
А bát, аанак ане 
Ап insect, xev item emnt eie —N GG GÀ 


=й» 


. Nothing at all ..................... 
. Leg of a table .. 


A snake with feathers .. 


. Picture from an airplane .. 
. Black paint ...... 


A cat's whiskers .. 


58. A butterfly eee 


A 


71. Animals climbing 


. Two women gossiping - 
. Grass 


. A white lamp ..- 
. Two female faces 


59. An oil explosion ....- 
бо. Rug of animal skin 


za 


. A topographical map 


. Male sex organs 


. A Pacific island .. 
- Nothing at all ... 
. A necklace ....-- 
. Rocks balancing «00000077 


— VIII — 


. A coat of arms .. ` 
- Nothing at all . ye 
a TWO ASR кар» iar sif ertt 


. Blood 
. A blotch of colors 


. Frogs’ heads ....................... s kaa sss 
. Colored butterflies . — 
. Modern art .........- .... — 
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VERY ° ONLYA NOT 


EXACTLY MUCH SOMEWHAT LITTLE АТ ALL 
LIKE IT LIKE IT LIKE IT LIKE IT LIKE IT 


. An evergreen tree .. 


= 
. Nothing at all 
. The liver ...... 


. A burning candle . 


; ВІООЙ „гео š 
. Two men in dunce — M و‎ 
. Two women 
. Smoke rising ...... 
. Bloody hands .... 
. А colored map TS 
Mu NT ERN E ина рр ас ар айнан на ыдын 


95. Snakes eating a rabbit's he 


ре 
. A biology slide .... ss 
. Leaves .............. 
. A wishbone ......... 
. Nothing at all . š 


ad . 


96. Flowers ................. м in Ni Salsa cte TUA ga 


97. Two lions 
98. Two people hanging in 
gg. A rabbit's head . 
100. Squashed | insects 


ою 


В. Six-ITEM QUESTIONNAIRE TO SCREEN NORMALS 


Answer each question ҮЕ$ or NO. If you are not sure. 
a , guess, 


. Have you ever had treatment for a nervous condition? 
+ In general, do you consider yourself a nervous person? 
. While in the service, were you ever hos j 


Жы ae pitalized fo; ji rosi 
or some similar condition? T psychoncurosis, combat fatigue 


. Since you have gotten out of the rvi 

s ^ service, have y ы 

irritable ог jumpy? * you generally (not merely occasionally) felt 
‚ Do you get upset more easily than the 
. Have you ever fi 


üverage person? 

led a disabili i r i 

bility claim or collected а pension for psych 
Sychoncurosis? 


nervousness 


ee‏ ی 


Lr 


tx 
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. Franck, K. Preferences for sex sy 


- Grincs, W. W. 
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A NORMATIVE STUDY? OF THE THEMATIC APPERCEPTION TEST: 


I. INTRODUCTION 


A NUMBER of studies have indicated 
that the ‘Thematic Apperception 
Test is a valid instrument for measuring 
the usual product of fantasy central to 
an individual (25, 36, 40, 49), and thus 
many investigators have assumed that 
this technique can be used as a basis for 
diagnostic differentiation among clinical 
groups. Preliminary investigations by 
this writer, however, have indicated that 
no broad group dillerences could be 
elicited, at least in respect to content (16) 
and emotional tone (19). and that hos- 
pitalization itself might be an important 
influence in determining the nature of 
the productions (19). It was also dis- 
closed in these studies that many of the 
diagnostic interpretations placed on cer- 
tain characteristics of TAT stories did 
hot apply to the groups of subjects they 
included and that there thus was a 
definite need for normative data compar- 
able with the Rorschach tables of Beck 
(6) and. Hertz (27). Were such data avail- 
able to all who use the TAT, many false 


‘The data included in this monograph were 
collected in part in connection with à Ph.D. 
dissertation, An investigation of the jantay fr 
ductions of college students and neuropsychiatric 
patients, University of Wisconsin, 1949- 


impressions about certain records would 
be corrected; and there would be less 
danger than there has been of the ex- 
aminer’s projecting his own biases into 
personality interpretations constructed 
on the basis of the TAT. Furthermore, 
the dangerous practice of “cue” analy- 
sis; often based on faulty analogical 
reasoning (17), and susceptible to misuse 
by the inexperienced, might be consider- 
ably curtailed if the examiner had re- 
course to adequate normative data. 

]t is the purpose of the present re- 
scarch to clarify three fundamental ques- 
tions which were raised by these prelim- 
inary investigations: 

1. Do the fantasy productions of per- 
sony suffering from behavior disorder 
resemble those of persons not diagnosed 
ill? 

2. Does the restricted environment of a 
psychiatric hospital affect systematically 
the fantasy productions of patients? 

3. How can the material obtained from 
the TAT be standardized to furnish ade- 
quate norms for use in a clinical situa- 
tion? 

2The use of specific characteristics of form 
and content of the subject's productions as path- 


ognomonic indicators for a given clinical syn- 
drome or dynamic factor. 


П. PROCEDURE © 


A. SUBJECTS 


HE SUBJECTS ОЁ the present study 
| 7 of 150 male veterans of 
World War II, divided into six groups 
of 25 each, which included: Groups C1 
and Сэ, non-hospitalized college stu- 
dents; Group NHPN, non-hospitalized 
psychoneurotics; Group HPN, hospital- 
ized psychoneurotics; Group HS, hos- 
pitalized schizophrenics; Group GNPH, 
general neuropsychiatric hospital popu- 
lation. The clinical groups contained the 
following subclassifications (all final di- 
agnoses rendered by a board of at least 
three psychiatrists): 


1. Group NHPN, Non-hospitalized Psycho- 


neurotics: 
anxiety ' 19 
mixed 3 
anxiety, conversion reaction 1 
anxiety with depression 1 
obsessive with anxiety 1 


rj 


. Group HPN, Hospitalized Psychoneurotics: 
anxiety 14 
mixed 7 
anxiety, conversion reaction 1 
anxiety with depression 1 
anxiety, sex deviate 1 
anxiety, emotional instability 1 


= 


Group HS, Hospitalized Schizophrenics: 
unclassified 
simple 
catatonic 
paranoid 
hebephrenic 


14 


M من‎ оо оо = 


4. Group GNPH, General 
Hospital Population: 

schizophrenic, paranoid 
schizophrenic, catatonic 
schizophrenic, unclassified 
schizophrenic, simple 
incipient schizophrenia 
manic depressive psychosis 
psychosis, unclassified 


Neuropsychiatric 


- юю ны DE 


* Out-patients seen at Madison VA Mental 
Hygiene clinic; in-patients seen at Mendota VA 
Hospital. The writer wishes to thank Dr. R. W. 
Trent and Dr. A. Bonar for their co-operation 


in permitting him to see patients at these facili- 
ties. 


psychoneurosis, anxiety 
psychoneurosis, conversion reaction 
reactive depression with alcoholism 
transvestism 

inadequate personality 
psychopathic personality 

drug addiction 

encephalopathy 

post epileptic 

observation 


5 = سے = = = = = = 


The reasons for the breakdown into 
Six groups of 25 each were to both sim- 
plify the statistical procedures (i.e., per- 
mit the use of frequency data in compari- 
sons without converting to percentages) 
and to provide experimental control 
over any group characteristics which 
might emerge, If significant differences 
occurred, we could see: (a) whether they 
Were more extensive than those between 
two unsystematically selected normal 
groups; (b) whether hospitalization itself 
accounted for some differences; (c) 
whether certain group characteristics 
could emerge even from a heterogeneous 
population (such as that constituted by 
Group СМРН). One of the shortcomings 
of the studies in the use of various tests 
in clinical diagnosis has been that not 
only do the diagnostic patterns derived 
from the examination of test protocols 
select individuals of the specific clinical 
group desired, but they also pick out in- 
dividuals of every other clinical category. 
In this study we have a check on whether 
the mere grouping of individuals into 
clusters of 25 does or does not in itsclf 
make for significant group characteris- 
tics. 

The groups were distributed according 
to age, education, IQ, and marital status 
as shown in Table 1. They are compara- 
ble with respect to these variables, al- 
though Group GNPH is least homogene- 
ous and has more extreme individuals 
than the other groups. 


А NORMATIVE STUDY OF THE ТАТ 3 
? TABLE 1 
AGE, EDUCATION, IQ, MARITAL SrATUS OF SUBJECTS 
Age Education IQ Marital Status 
Group* N M а 
A ar- Di- 
Range Mean Range Mean Range Mean xied osea 
Ст 25 20-36 25.9 13-17 14.4 7 
C2 25 20-35 25.8 13-18 14.5 7 
NHPN 25 20-34 25.4 12-16 13.5 103-128 114.2 7 2 
HPN 25 19-35 20.1 12-15 13.0 100-126 111.3 IS 
HS 25 20-30 24.2 12-17 13.3 100-133 112.9 б 
СМРН 25 20-49 30:6 9-17 12.1 00-128 100.7 9 2 


^ The following notation will be used throughout the Tables and Appendices to identify the various 


groups: 


Ci—First group of college students 
C2—Second group of college students : 
NHPN—Group of non-hospitalized psychoneurotics 
HPN—Group of hospitalized psychoneurotics 
HS—Group of hospitalized schizophrenics | I 
GNPH—Group of general neuropsychiatric hospital patients. 


No 10 scores are available for the college stu- 
dents, but it is assumed they are at least of 
normal intelligence. It has been reported in the 
literature (46) that, after a normal level of in- 
telligence has been reached, there is no corre- 
lation between quantity or quality of response 
on the TAT and added increments of intelli- 
gence. Indeed, one investigator states that stories 
With satisfactory content for analysis have been 
obtained with subjects with IQs as low as 80 
and consistently satisfactory stories were ob- 
tained from subjects with IQ of go and above 
(28). Since the IQ of no subject in the clinical 
groups is below normal, it is felt that this factor 
has been adequately controlled. 


B. MATERIAL AND ADMINISTRATION 


All 20 cards recommended for adult 
males (Harvard University Press, grd 
Edition) were used. Administration of 
the test was individual and the only ones 
present in the testing room were the 
Subject and examiner. The examiner 
took verbatim notes, as far as possible, 
of everything that the subject said. Al- 
though the authors of the test suggest an 
interval of one week between the admin- 
istration of the first and second 10 cards 
(41), hospital and clinic routine did not 
Permit such a procedure in this study. It 
Was therefore necessary to administer the 


whole series at one session with a ten- 
minute break between halves. This 

e 
method has been recommended by cer- 
tain investigators (46). Directions to all 
the subjects were identical and as fol- 
lows: 

This is a test of imagination. Here are some 
pictures about which I would like you to make 
up some stories. You are to tell what happened 
before, what is happening in the picture, and 
how it's going to turn out, what the characters 
are thinking and feeling. Take about four or 
five minutes for cach one but be sure to include 
all these things. All that we want is a plot for 
each picture. No elaborate literary masterpiece is 
necessary. This is the first one, see what you can 
do with it. 

Each subject was prompted when any 
of the elements of the story as given in 
the directions was omitted, However, 
during any single story no subject was 
prompted more than once for any par- 
ticular element. After the first ten cards 
were administered, the subjects were told 
that the test was half over and that there 
would be a ten-minute break during 
which they could go to the lavatory, have 
a drink of water, etc. After approxi- 
mately ten minutes the test was resumed 
with the following instructions: 


4 LEONARD D. ERON 


The stories which you told in the first half 
were all very good—just the kind of story I 
wanted. But the pictures I'm going to show you 
now are even more difficult to make up stories 
about. They are more vague and ambiguous, not 
as clearly defined, and you're really going to 
have to use your imagination to make up some 
meaningful stories. So give free rein to your 
imagination now and make up some good stories 
about these pictures. 

For the sixteenth card, which is blank, 
there were the special directions as 
follows: 


"Now look at this blank card. Imagine that 
you see some kind of picture there. Tell me 
what it is." After the subject described the pic- 
ture he was then told, "Now make up a story 
about it." 


C. TECHNIQUE OF ANALYZING STORIES 


After the stories were all collected 
they were analyzed for the following 
variables: themes, identification of char- 
acters, perceptual distortions, unusual 
details, level of interpretation, emotional 
tone of the body of the story and out- 
come, as follows: 

(1) The thematic analysis followed a 
check list of themes empirically derived 
in the following manner. After 50 proto- 
cols had been collected they were perused 
at least twice. The first time, each story 
was read through and the action taking 
place was summarized in a few words, 
After this was done for all the stories, 
the author, with the help of another 
psychologist examined the summaries 
and a check list of 98 themes was or- 
ganized under the two general headings 
of disequilibrium and equilibrium, indi- 
cating the state of tension or adjustment 
displayed in the story. These two gen- 
eral groups were subdivided into inter- 
personal, intrapersonal, and impersonal 
classifications, depending upon the 
sphere to which the situation was re- 
ferred. Тһе interpersonal classification 


* Dr. Ann Margaret. 


was furthef broken down into sections 
dealing with parent; partner, peer, and 
sibling. Each theme appearing in the 
classification was named and defined and 
examples of each given, so that the same 
list could be used in further rescarch. 
On the basis of additional experience the 
list was revised and expanded to include 
125 themes. The final check list as used 
in this study, with definitions of each 
theme, appears in Appendix B. During 
the examination of the stories, each 
theme was tallied as it appeared, so that 
most stories have more than one theme 
and rarely are there stories with no 
themes which appear on the check list 
and which are thus impossible to classify. 
Only the manifest content was con- 
sidered—the actual behavior of the char- 
acters in the stories as narrated by the 
subject, regardless of its covert signifi- 
cance. The thematic analysis was done 
by the experimenter, since it is actually 
merely a counting procedure with а 
minimal judgmental function—a theme 
is either present or not, and the sole cri- 
terion is the verbal statement of the sub- 
ject. 

(2) Identification of characters i 
cluded the sex and age of each of the 
individuals depicted on the cards, thé 
relationship when there were two OF 
more characters, and specific identifying 
features, including in some cases, nation- 
ality, occupation, physical characteristics; 
efe. In determining perceptual distor- 
tons, the standard taken was the de- 
scription of each picture offered’ by the 
authors of the test (41). Any deviations 
from this standard were noted. Notice 
was also taken of attention to unusual 

? Parent substitutes such as older authority 
figures are included under the heading © 
parent.” Authority figures of age range similar 


to hero are classified with “peer” or "partner, 
whichever is appropriate. 
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details such as pregnancy?of the woman 
on card 2, lack of tie on man on card 4, 
etc. Only one judge, the experimenter, 
was used for identifications of characters, 
as this too is merely a counting pro- 
cedure, 

(3) The level of interpretation refers 
to whether the responses were narratives, 
mere descriptions, symbolic, abstract, etc. 
and also to unusual formal character- 
istics of the stories such as inclusion of 
the examiner in the story, no connection 
between picture and story told, etc. A 
list of these characteristics and their def- 
initions appears in the latter part of 
Appendix B. Since the judgmental func- 
tion is minimal here also only one judge 
was used. 

(4) To rate the emotional tone of the 
stories and outcomes, it was decided to 
improve upon the three-point scale used 
in a previous study (19). Consequently 
a five-point scale was developed by the 
author and two collaborators (18). The 
scales which appear in Appendix А were 
derived in the following manner: 500 
TAT stories were read independently by 
three judges who indicated the rating of 
each story on a 10 cm. line representing 
a continuum from happy to sad. Upon 
completion of the ratings the lines were 
divided into 10 one cm. intervals and 
an average Pearson product-moment cor- 
relation coefficient of .76 was obtained 
among the raters. The 10 cm. lines were 
then divided into five 2 cm. intervals 
and each interval was assigned a weight 
as follows, ranging from sad to happy: 
—92, —1, o, +1, +2. Each interval was 
described on the basis of those stories 
for which there was complete agreement 
among the three judges, ie, all three 
ratings fell in that particular interval. 

Individual scales were constructed for 
each story for Emotional Tone and 


Outcome and general scales for the two 
variables were also devised. The indi- 
vidual scales and the general scale for 
Outcome were so similar that only the 
general scale was used in rating the out- 
come, although the individual scales 
were used for emotional tone.* The inter- 
interpreter reliability of this scale, based 
on ratings of three judges for 200 addi- 
tional stories (10 complete protocols) was 
rather high. The average Pearson т for 
emotional tone was .86; and for outcome, 
75 (18). It was felt that, since the rating 
of tone and outcome was a judgmental 
process, someone other than the investi- 
gator should rate the stories as the latter 
might be biased by a knowledge of the 
diagnostic classification of the subjects. 
Therefore independent ratings of the 
emotional tone and outcome were made 
by an individual who had no clue as to 
which were the normal stories and which 
belonged to members of the various 
clinical groups.* 


D. STATISTICAL METHOD 
All the data collected in this study 
(e.g, judgments of emotional tone, 
themes, perceptual distortions, etc.) were 
all of the frequency type and our com- 
parisons, therefore, employ the chi square 
technique. The formula was one with 
correction for continuity suggested for 

use when p = q = 0.5 (14). 
({m—s]—1)* 


Chi square = 


m+ 5 
In general each variable was treated in 
two ways: 
(a) First the data were analyzed on the 
basis of total responses, i.e., each group 


4 Individual scales for emotional tone of stories 
and the general scale for outcome appear in 
Appendix A. : 

3 Mrs. Dorothy T. Hake rated all the stories 
for emotional tone and outcome, 


6 LEONARD D. ERON 


Was compared with every other group 
on the basis of total number of times 
a given characteristic was present in 
those groups for all 20 stories. This means 
that each individual contributes more or 
fewer responses to the analysis. 

(b) Next the data were analyzed on 
the basis of individuals, that is the num- 
ber of persons in each group contribut- 
ing a given number or more of responses 
of each type. This provides information 
concerning whether the factor causing 
group differences was a greater mean 
number of responses per individual in 
the group or just a few extreme individ- 
uals. Also if there are no significant dif- 
ferences between groups, the reason 
might be, not that all individuals in 
both groups contribute approximately 
the same number of responses, but that 
in the one, occasional subjects are re- 
sponsible for the majority of responses, 


while in the other all are contr; 


ibuting 
just a few. 


In other words we would 
have some indication of the homogen- 


eity of variarice within the groups. Pre- 
sumably, also, if one group contributed 
significantly more of certain responses 
than another, and also contained sig- 
nificantly more individuals who made 
Such responses, we could be more cer- 
tain that these differences in responses 
represent real group characteristics, One 
of the inadequacies of past research on 
diagnostic testing is that it has been 
based on responses and not on individ- 
uals. This means that any generalization 
of the conclusions from the groups on 
whom the work was done to other groups 
is unjustified since intra-group varia- 
bility has not been considered. 

Chi square tests of homogeneity (20, 
33) were also performed in the analysis 
of individual themes and ratings of tone 
and outcome. The .os level of confi- 
dence (55) was adopted as the criterion 
of significance and all differences sig- 
nificant at this level were noted, both in 


the individual chi square tests and in the 
tests of homogeneity. 


س ر 


III.SRESULTS AND DISCUSSION 


А. EMOTIONAL TONE OF STORIES 


Г 15 obvious from Table 2 that the 
preponderance of stories told by all 
Subjects in response to the TAT are sad, 
although there is a little significant inter- 
group variation with respect to the pro- 
portion of stories in each category of 
the scale (chi square of homogeneity, 
94-9900, with 20 degrees of freedom, sig- 
nificant beyond the .oo1 level). A com- 
Parison of each group of subjects with 
every other group indicates that the six 
groups arrange themselves into two dis- 
tinct populations depending on whether 
or not they are hospitalized. The hos- 
pitalized groups appear to be emotion- 
ally duller than the non-hospitalized 
ones since those in the hospital tell more 
of the neutral stories and less of the 
extremely emotional ones than those not 
in the hospital. 


These differences might be a function of the 
seriousness of the illness since undoubtedly those 
in the hospital are more seriously ill than the 
Out-patient group. However, the relationship is 
not always a progressive one, with the non-hos- 
Pitalized psychoneurotics more like the hospital 
Broups than are the normal subjects, and the 
Psychoneurotics more like the normals than the 
Schizophrenics. There are some instances in 


which the  non-hospitalized — psychoneurotics 
differ significantly from the hospitalized ones 
and not from the schizophrenics (e.g., total num- 
ber of sad stories). If degree of illness caused the 
increasingly neutral stories it would be expected 
that the schizophrenics would have more neutral 
stories than the psychoneurotics and that the 
hospitalized psychoneurotics would be more like 
the non-hospitalized ones than the schizophren- 
ics. This is not the case and thus it seems that 
the differences obtained are a function of the 
hospitalization and not the degree of illness. 
An analysis of the data according to the number 
of individuals contributing a given number or 
more of stories of each category of emotional 
tone produces the same result. The hospitalized 
subjects, regardless of classification, give more 
stories which are neutral or flat in the amount 
of feeling invested in them. 


This is the same type of phenomenon 
that has been noted in individuals ex- 
pesed to other kinds of restricted, 
routinized, and deprived environments 
such as concentration camps, prisons, 
and orphanages (2, 9, 10, 21, 22, 23, 42, 


"Of course it is understood that the rough 
diagnostic categories here used are not the best 
indicators of severity of illness and that to 
study definitively the effect of hospitalization, 
duration of stay in the hospital would have to 
be strictly controlled and systematically varied. 
However, within the limits of this study it is 
highly suggestive that our obtained differences 
are the result of hospitalization and not severity 
of illness. 


TABLE 2 
FREQUENCY OF STORIES OF DIFFERENT EMOTIONAL TONE, By GROUP 
š Group йт. ag * 
Pu Ст C2 NHPN HPN HS GNPH Total Parison square 
I 13 13 6 8o 
Fic vi 6 Pi 52 55 54 359 —1, о 186.5721 .оот 
o 8$ 97 77 142 159 171 731 SS M e s 
—1 249 254 246 201 205 201 1356 وك‎ pu 479220 63 
—2 74 73 85 79 бо 55 426 
T I 13.6 12 14.6 
юш me Ho NT Wo H^ o di esed nece oor 
Total 66.2 56 5 51.2 50-4 
т Bs, ° Fu 221 280 265 256 1782 


* Upper figures are percentages. Lower figures are frequencies. 
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51). Reports of observations on such 
populations all stress the dulling of social 
adaptation, the apathy, indifference, and 
aimlessness which such controlled envi- 
ronments engender among their inhabi- 
tants. Greenson (23) discusses the cases 
of men who had been exposed during 
the war to adverse conditions, often free 
of actual danger, but with poor food, in 
bad climate, or with severe boredom and 
great loneliness. The apathy which de- 
velops in such individuals, he believes, 
is specifically caused by deprivation, 
which for Greenson refers to “lack or 
withdrawal of the necessary supplies of 
love (affection, deference) or food, or 
both, which the individual requires from 
his environment.” Apathy is the defense 
against painful perceptions and serves 
the purpose of avoiding “overwhelming 
feelings of annihilation."? 

One possible explanation, thus, of the 
relatively flattened affect of the stories 
told by our hospitalized patients is that 
it is a reflection of the insulation they 

*It is interesting that the prewar histories in 
his group of patients showed that men from 
broken homes and orphan asylums formed a rel- 


atively high percentage of those who developed 
apathy. 


have built around themselves as а de- 
[ense against the lack of affective bonds 
which they experience in this circum- 
scribed environment. It is an effect not 
limited to just one kind of patient, e.g. 
schizophrenic, but one which applies to 
all our hospitalized subjects. 


B. OUTCOME or STORIES 


As can be seen from Table g, the out- 
comes are more variable for all groups 
of subjects than the emotional tone of 
the stories, which seems to be determined 
largely by the stimulus properties of the 
pictures themselves. The outcomes ap- 
pear to be more clearly a function of 
the individual's creativity. 

The most frequent outcome is that de- 
scribed as moderately happy, although 
the frequency in this category is not signi- 
ficantly greater than that in the neutral 
category. The fact that story outcomes 
tend to be happy does not agree with 
Rautman and Brower's findings with 
children (45), although it does with Cole- 
man's investigations with children (15), 
and with а former study done with adults 
(19). Rautman and Brower, however, 
worked with only 10 cards, three of which 


TABLE 3 
FREQUENCY OF STORIES OF DIFFERENT OUTCOMES, BY GROUPS 
Out- Group m l کڪ‎ =~ 
come 1 " Com- Chi- 
Cr Сз NHPN HPN HS  GNPH Total Parison square T 
+2 45 27 33 16 1 
33 7 8 s 
+r 174 158 143 148 — 143 iis «m 
ACH Ch Cw RO DO ч бы o p? degens ош 
5 as , r ` 54 Р 
—2 56 67 76 4 E Cw deeem E 
? 24 29 69 105 7ї 76 Em 
Total 42.4 37.7 35.2 
37. 32.8 f кос 
happy? 219 185 176 164 кз А ы ee? 
ioi 27.4 2026 26.8 16.6 17.8 15.2 
sad* 137 13 13 $ j ca 
4 83 89 76 657;  happy,sad 101.2462 .oor 


^ Upper figures are percentages 


‚ lower figures are frequencies, 
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are not in this series, and®administered 
the test in group fashion. Thus their re- 
sults are not strictly comparable to those 
of the present study. However, they do 
mention the fact that happy endings 
were more common among teachers who 
took the test than with the children, but 
give no indication of the extent of the 
difference. There might thus be a dif- 
ference between children and adults on 
this variable. 

There is also much more intergroup 
variability in relation to outcome than 
was found in a consideration of emo- 
tional tone (chi square of homogeneity 
= 184.61 with 25 degrees of freedom, 
Significant far beyond the .001 level of 
confidence). The differences between the 
hospitalized and non-hospitalized groups 
are not as clearly defined as they were 
in the previous analysis, although the 
hospitalized groups give more neutral 
and less extreme outcomes than do the 
non-hospitalized | groups. In addition, 
cach non-hospitalized group gives signifi- 
cantly morc sad outcomes than each hos- 
pital group. When the data are analyzed 
by the number of individuals in each 
group contributing à certain number or 
more of a given outcome, the pattern 1s 
er individuals in the hos- 


the same—few 
give sad outcomes and 


pitalized groups 
fewer also give extreme outcomes. Sig- 
nificantly more non-hospitalized psycho- 
neurotics give a large number of mod- 
erately sad outcomes than do hospitalized 
psychoneurotics and, as in the analysis 
of emotional tone, it would thus seem 
that hospitalization itself is a factor in 
making the outcomes more bland.* 
Both normal groups give significantly 
less questionable outcomes than each of 
the clinical groups. Of all the clinical 
Broups, the hospitalized psychoneurotics 


ag, 
See note on page 14- 


give the most such conclusions, signifi- 
cantly more than both the non-hospital- 
ized psychoneurotics and the schizophren- 
ics, Essentially the same picture is 
obtained when the data are analyzed by 
individuals. No normal subject gives 
more than four such outcomes, whereas 
in the clinical groups there are individ- 
uals contributing as many as 12. This. 
excess of questionable outcomes among 
all the clinical groups is perhaps а re- 
flection of the indecisiveness and uncer- 
tainty of these patients who are not as 
forthright and sure of themselves as are 
normal individuals. This is carried to an 
extreme by the hospitalized psychoneu- 


rotics. 

C. SHIFT IN TONE FROM BODY or 

š Srory TO OUTCOME 

The fact that many individuals turn 
unhappy stories into happy conclusions 
has been noted before (15, 19) and the 
frequency of this phenomenon for all 
our groups of subjects appears in 
Table 4. 

It has been stated that the shift to 
happier conclusions is characteristic of 
schizophrenic individuals (26). Our data 
would demonstrate it to be more repre- 
sentative of normal persons than it is of 
schizophrenics. Another writer has char- 
acterized it as a "refusal to accept conse- 
quences which (in children) may lay the 
foundations for later neurotic behavior" 
(45. P- 200). Actually our normal sub- 
jects also manilest this characteristic more 
frequently than do the psychoneurotics, 
significantly so in the case of the out- 
patient group. That individuals often 
tack illogically happy endings on to very 
sad stories is not surprising since, by the 
mere instructions to make up a story, 
we are inviting the subject to indulge 
in fantasy. And one of the characteristics 
of fantasy is that it disregards logic and 


ы. 
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reason (39). Furthermore, the individual 
is following a cultural pattern illustrated 
by our movies which, producers seem to 
feel, must always have happy endings in 
order to be successful financially and 
which thus reflect the demand of the 
moviegoers. 


D. PERCEPTUAL DISTORTIONS 
1. Misidentification of Sex 


Instances in which the subject mis- 
identifies the sex of characters in the pic- 
tures are frequent and there is no sig- 
nificant difference among the clinical and 
normal groups in the total number of 
such misidentifications (Table 5). Nor is 
there a significant difference in the num- 
ber of individuals in each group making 
them. There are certain pictures in 
which sexual misidentification is fre- 
quent; others where it is completely ab- 
sent, Some errors in sex identification are 
made by individuals in each group; 
others, only by individuals in the clinical 
groups; and one, only by a normal indi- 
vidual. 

Many clinicians interpret the fact that 
a subject misidentifies the sex of a char- 
acter in the picture as an indication of 
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his own sex confusion. Numerous writ- 
ers (7, 26, 44, 46, 52) have specifically 
indicated that sexual misidentification of 
the character on Card 3BM is an indica- 
tion of a high feminine component in 
the personality of the male subject. How- 
ever, more than half the subjects are con- 
fused over the sex of this character and. 
this corroborates an earlier investigation 
(16). It is difficult to see how this can 
legitimately be called a projection of the 
subject’s feminine identification when 
fewer individuals identify it correctly 
than err in identification. This too would 
seem to be more a function of the 
stimulus properties of the card. There 
may be some justification in such a state- 
ment about the other pictures, where 
misidentification is rare, and where it 
can be assumed there is actual distortion 
on the part of the subject when he mis- 
identifies a character. It should be noted, 
however, that among our subjects are 7 
homosexuals, ranging from one diagnosed 
as latent homosexual to one who has 
been a transvestite for a number of years, 
and in no one of these cases is there a 
single instance of sexual misidentifica- 
tion. 


TABLE 5 
FREQUENCY OF INSTANCES OF SEX CONFUSION* 

c Group 

ard 

Cr C2 NHPN HPN HS GNPH ^ Total 

3BM 15 12 19 12 12 15 85 

5 т т 

6BM I 1 

8BM £ I І 1 І 2 7 
10 I 4 т 2 8 
12BM 3 2 4 3 2 6 20 
14 т 1 2 
I5 I 2 т 2 6 
20 6 4 4 3 5 5 27 
Total 27 20 зо 23 24 33 157 
Percentage 5.4 a 6 4.6 4.8 6.6 5.0 


. * This includes outright misid 
18 male or female. For exact па 
Pendix С. 


entification and instances in which subject is unsure whether character 
ture of these confusions and all perceptual distortions, refer to Ap- 


ther Perceptual Distortions 


Although’ the most frequent perceptual 
distortions are the sexual misidentifica- 
tions, Table 6 shows that other distor- 
tions are also not infrequent and occur 
in each group. By “other perceptual dis- 
tortions” are meant any departures from 
the standard description of the picture 
as described in the test manual (41). 
These include such things as identifying 
the violin in card 1 as a typewriter, the 
gun in card 3 as a pipe, the woman in 
card 4 as a negress Again there arc 
some cards for which distortions are fre- 
quent in all groups and other cards in 
which perceptual distortions are made 
only by members of the clinical groups. 
The general hospital group produces sig- 
nificantly more distortions than all other 
groups. But there are no significant dif- 
ferences among any of the groups as to 
number of individuals producing these 


“For instances of specific distortions see Ap- 
pendix C. 
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distortions. From 70% (Psychoneurotic 
in-patients) to 100% (General hospital 
group) of the individuals in each group 
produce at least one distortion and the 
schizophrenics produce significantly no 
more than do the normals or psychoneu- 
rotics. 

A common characteristic ascribed to 
TAT productions of schizophrenic pa- 
tients has been frequent distortion of 
objective characteristics of the pictures 
(4, 44» 46). However, these data show 
that perceptual distortions are no more 
representative of our group of schizo- 
phrenics than they are of any of our other 
subjects, normal or ill. This is evidence 
of Cameron's "principle of continuity" 
(13). Distortions of reality are common 
among normal individuals as well as 
among persons with behavior disorders. 
The only difference, important as it may 
be, is that normal people are better able 
to check up on their faulty perceptions 
through appropriate and practiced tech- 


TABLE 6 


FREQUENCY ОЕ OTHER PERCEPTUAL Distortions 


Groups Significant Intergroup 
Card Differences 
Cx C2 NHPN HPN Tot: ; 
М HS GNPH Total Comparisons Chi-square Р 
I I 3 I I 6 
3BM 7 13 5 5 1 14 re 
4 6 2 I 6 3 21 
6BM 3 А а 
;BM 1 1 2 
8ВМ 1а 8 9 
BM 3 6 2 3 A Е ee 
12BM 3 I $ 4 4 6 23 
13MF I 3 $ 6 28 
14 2 2 5 2 
15 3 I 5 ч 
1;BM І 2 3 ó 
18BM 6 3 2 5 5 21 
Tot 1 8 
ota 39 34 31 31 8 << I 
Percentage 7.8 6.8 6.2 6.2 18 DM 555 Ci-GNPH 7.2900 .01 


-5 C2-GNPH 
HPN-CNPH 
NHPN-GNPH 
HS-GNPH 


10.0104 .005 
10.7895 .005 
10.0104 .005 


* No significant differences when analyz 


ed by individuals, 


4.8090 .03" 
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niques of social validatiow. The critical 
point in the development of delusions, 
C.g., is, in addition to the need and the 
“reaction sensitivity,” the failure of cor- 
rection, 

It is difficult to see, on the basis of our 
data, how perceptual inaccuracies can 
be used as a diagnostic "cue" for schizo- 
phrenia. Of course, the schizophrenics in 
our sample are all young, just recently 
diagnosed, and with comparatively little 
deterioration. However, these are the 
cases which are the diagnostic problems. 
The TAT is not necessary in order to 
ascertain that a deteriorated schizo- 
phrenic has bizarre ideas or distorts re- 
ашу. But when the TAT is used as a 
diagnostic instrument with suspected 
cases of schizophrenia it is unjustifiable 
to employ presence or absence of per- 
ceptual distortions as a "sign." 


E. THEMES 


In the approximately 3,000 stories 
which were analyzed in this study there 
were 6,210 discernible themes which 
could be classified according to our check 
list. These were contributed by the vari- 
ous groups as indicated in Table 7. All 
the groups, except (GNPH), are about 
equally productive, with no significant 
difference in the number of themes pro- 
duced. The last group contributes sig- 
nificantly less than each of the other 


TABLE 7 


NUMBER or THEMES CONTRIBUTED BY 
SUBJECTS IN EACH GROUP 


Average No. 


Group No. of Themes per Individual 
Ci 1,073 43 
C; 1,088 44 
NHPN 1,067 43 
HPN 1,017 41 
HS 1,034 4 
GNPH got 38 


groups, except (HPN), and the differ- 
ence, cven in this comparison (chi square 
= 3.7809), approaches significance at our 
criterion. It is obvious that lack of pro- 
ductivity is not a characteristic of any 
of our more homogeneous groups (in- 
deed, in average number of responses 
per individual there is little difference 
among any of the groups). Schizophrenics 
as a clas do not show an unusual 
amount of blocking as has been re- 
ported (4, 8). 

As for the themes themselves, the бо 
most popular ones, including all those 
appearing at least 30 times (or in ap- 
proximately 1% of the stories), are listed 
in Table 8. The inspection of this table 
makes it apparent that themes of viol- 
ence, hostility, death, restriction, guilt, 
and frustration are common to subjects 
of all types in response to the TAT and 
are not peculiar to any particular clinical 
group as has been reported in the litera- 
ture (5, 8, 25, 30, 31, 82, 35, 44, 47, 50). 
Themes for which these subjects do not 
comprise a homogeneous population ap- 
pear in Table 9. There are 11 such in- 
dividual themes in our classification 
scheme of 125 themes, which is more 
than could be expected by chance at the 
.o5 confidence level. However, when in- 
dividual comparisons between groups are 
considered, only 72 significant differences, 
listed in Table 10, occur out of a total of 
1,875 comparisons (125 themes X 15 com- 
parisons per theme). This is less than 
would be expected by chance 5% of the 
time. It should be noted that significance 
could not be calculated by the chi square 
procedure in 321 comparisons because 
there are 45 themes for which one or 
more groups have a zero frequency. But 


* By unusual is meant a difference in fre- 
quency which is statistically significant at the 
.05 level of confidence. 
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T 0: 3 ; 150 3,0 STORIES? 
F E TRIBUTED BY 'UBJECTS IN 3,000 E 

vr THEMES CONTRI so 5 TAI 

Мозт FREQUENT THEMES 


Group 
Theme 1 
- Cx Co NHPN HPN HS GNPH Tota 
Name 
No. 289 
= í parents oe б 25 ES sE 5 257 
IAra Pressure from pa alsoe 1: 49 47 А 264 
8 ICs Aggression from impersonal sou 18 54 45 40 «n da 207 
3 IBr Aspiration 34 и 32 20 So er 186 
Curiosity ` 2 32 33 68 
: ТАЗЬ Succorance from parents a K A dd 28 17 а 
Š IB4 Penavion disorder 27 31 26 25 28 29 262 
Vacillation 2 39 33 
ИЛ бао m 2 $2 = 5 & 
Occupational dissatisfa 
т Bi Physcalillness or death of central 2 M ei um 24 31 m 
S character 29 26 21 27 27 17 146 
тт ICr Economic pressure 29 31 28 24 12 а a 146 
War 3 : 24 32 
15 1186 Occupational satisfaction n 2 7 ad 28 RU 143 
i4 IB;  Guilt—remorse S 399 ах g y a 
15 IA3b  Succorance from peer 25 26 22 27 19 17 xt 
16 IB8 ‘Drunkenness 23 26 25 19 17 22 e 
17 IA2a Pressure from partner i 18 i 25 30 22 ү 
Religion 2 2 22 І 
е 1816 АЕР аи towards environment 2 = ся 23 is i3 108 
20 IB2 dhadequacy 235 — 29 4 ns 14 1 ps 
h icit sex X rj 2 1 14 1 
A ne Death or illness of parent 16 = = D 10 15 2 
23 ПАгс Contentment of partner 3 > xb ү 5 7 88 
24 ТАзј  Belongingness 6 її то 29 18 14 8 
25 IC2 Legal restriction £ 2. 15 25 7 18 тї 4 
поразу Ds, сена тї 14 15 15 20 8 Ei 
оп to 8 
27 lose Eid ehem perilous environment 14 r 12 d zs 29 7 
29 IBr3 Retribution 2 E 2: is 18 9 2 
3o IBs Suicide 1s 0 9 i 8 12 z 
31 IBi; Loneliness 3 15 15 7 13 8 z 
ear ae 
33 Ha Intrapersonal tranquility A > E 2 E Е бт 
m Hypnotism © бо 
+ ТАЗ Death or illness of child Š 13 9 3 ae Й 50 
36 TA3d пешеп from peer : 9 "T 2 = 14 58 
Exhaustion 
33 IA Death or illness of peer II 8 10 8 10 то 2 
š IBro  Rumination 15 18 6 5 7 ss 
d IA2f Departure from partner 8 15 8 7 8 9 23 
ООСС Hh parent то à 8 4 о Š s 
2 IA3a Pressure from peer | II 12 6 9 5 9 31 
4 1C3 Generalized restriction 17 14 7 5 4 4 [2 
42 IIBa Reminiscence, happy 10 7 то т2 4 8 z 
45 IA2B  Succorance from partner 6 7 6 її 8 7 S 
46 IAze Aggression towards partner 4 8 9 12 8 2 Аз 
47 IIBr Self esteem — | м 9 то 8 6 3 7 di 
48 IIB; Ordinary. activity intra-personal 5 7 4 5 7 14 А 
49 IAim Filial obligation 8 8 8 3 6 8 8 
so IA2n Partner competition 3 4 8 7 7 9 38 
51 ПС Favorable environment 9 8 s 8 7 I š 6 
52 HA3e Exhibition ` 8 7 6 3 4 8 a 
53 IA:p News of marriage to Parent 5 її 2 6 5 6 2 
54 IBr4 Reminiscence, sad 6 6 6 3 8 5 3 
55 IArg Parental concern 5 3 5 7 1 6 33 
56 IA2c Nurturance to partner 7 6 8 A 3 5 33 
57 IA3x Competition with peers то 6 4 2 5 6 33 
58 IB6 Moral struggle 2 6 9 4 4 5 30 
59 IAzm Jealousy of partner 5 10 3 6 3 3 3° 
бо IArk Disappointment to parent 7 5 2 7 A $ 3° 
* Included are all themes appearing in 1% 
order of decreasing frequi 


(зо) or more of the 3,000 stories. "Themes are listed in 
ency. 


- For definitions of specific themes see Appendix B, 
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° TABLE 9 


THEMES ror WHICH ALL GROUPS or SUBJECTS Do Nor МАКЕ UP A HOMOGENEOUS 
PoPULATION AT .05 CONFIDENCE LEVEL 


Theme Chi-square Р 
: (s degrees 
No. Name of homogeneity of freedom) 
IA 12 Pressure from Parent 19.2290 .or 
IAij Death or Illness of Child 13.2000 К 
Are Nurturance to Parent ж 16.2500 .OI 
"Total Themes Parental Disequilibrium 14.9178 .02 
IA2h Illicit Sex 21.5204 .оот 
IAsj Death or Illness of Partner — , 12.2221 05 
I Total Themes Partner Disequilibrium 11.4887 105 
Аз} Belongingness аф. 15.3335 .o1 
Total Themes Interpersonal Disequilibrium 17.0505 ТӨТ 
1Вто Rumination h 27.1578 .оот 
IBr: Occupational Dissatisfaction 12.7306 .os 
ІС2 Legal Restriction E 21.8668 .оо1 
Total Themes of Disequilibrium 21.5186 .оот 
IC3 Generalized Restriction of Environment 20.3203 ТӨТ 
Ard Reunion with Parents 12.0549 .os 
Total No. of Themes 16.3400 .OI 


even on the basis of the remaining 1,554 
comparisons, there are fewer significant 
differences than could be expected by 
chance. Thus it would seem that, at least 
on the basis here proposed, content analy- 
sis cannot differentiate diagnostic classi- 
fications. 

In order to say that any given feature 
characterizes a certain clinical group, this 
feature must distinguish that group both 
from normal individuals and from other 
clinical groups. Not one of our classes of 
subjects differs significantly from all other 
classes in any given theme. Furthermore, 
no class differs from both normal groups 
in more than five themes, and there are 
no themes for which the non-hospitalized 
psychoneurotics differ from both normal 
groups. In fact there are fewer significant 
differences between one normal group 
(C1) and one clinical group (NHPN) 
than there are between the two normal 
groups (Ci) and (C2). By chance there 
Should be more than six such differences 
in 125 comparisons. There are only five 
Pairs of groups that have more than six 
Significant differences between them and 


thus differ more than could be expected 
by chance in regard to themes. These are 
(C1) and (HPN) with 10 significant dif- 
ferences, (C1), and (GNPH) with seven 
significant differences, (C1) and (HS) 
with eight significant differences, (C2) 
and (HPN) with seven significant differ- 
ences, and (C2) and (GNPH) with eight 
significant differences. 

As was mentioned above, there are 45 
themes in which the frequency for at 
least one group is zero and therefore 321 
comparisons could not be made. The 
maximum number of such instances for 
any particular comparison, however, is 
28, and when the total number of com- 
parisons between any two groups is re- 
duced by the number of times that the 
chi square could not be calculated, the 
number of differences more frequent than ` 
could be obtained by chance between 
any two classes is not greatly affected. 
The comparison between another pair 
of groups, (HPN) and (GNPH) can now 
be added to the above list. It would seem 
therefore, that we could say definitely no 
more than that hospitalized psychoneu- 
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TABLE ro, o 
DIFFERENCES IN THEMES AND INTERPRETATION LEVELS SIGNIFICANT AT OR BEYOND .05 
CONFIDENCE LEVEL 
uw 1 O, a 2 
No. Name q Ë ә 
а from Parent C2 65 HPN 3r 1.3437 .оо1” 
ТА та Pressure fr e 6 GNPH 34 жо puer 
HS 58 НРМ 3: 7.5955 .or 
HS 58 GNPH 34 5.7500 .02 
Cr 51 HPN 31 4.4024 .035 
NHPN so НРМ 3r 4.0000 .05 
IArc Nurturance to Parent (Gr 9" HPN I 4.9000 103 
Cr 9^ HS 1 4.9000 E 
C2 8 HPN I 4.0000 105 
C2 8% HS I 4.0000 «o8 
ТА Departure from Parent NHPN 2o HPN 7 5.3333 025 
1А Death or Illness of Parent Cr . HE Са 9 4.6451 035 
NHPN 22 Сг 9 4.6451 035 
IA:j Death or Illness of Child сы 18 НРХ 3 9.3333 .005” 
š C2 13 HPN 3 5.0625 +025 
Ci 18 GNPH 7 4.0000 +05 
IMp Marriage of Child C2 1 NHPN 2 4.9231 .03 
IA2e Aggression to Partner HPN 12 GNPH 2 5.7857 .02 
IA2h Illicit Sex C2 29 7 2.2500 oot” 
Cr 25 7 9.0312 ‚оозу 
€ 20 14 4.5581 .035” 
c 29 14 4.5581 .035 
¿ 29 15 3.8409 +05 
TA2j Death or Illness of Partner HPN 39 Cr 18 7.0175 ‚өт 
HPN 39 GNPH 2o лате .02 
HS 33 1 18 3.8431 .os^ 
IA3b Succorance from Peer GNPH 32 HPN 14 6.2826 015 
ТАзе Aggression to Peer HS 20 GNPH 8 T€— am 
ТАза Aggression from Peer NHPN ту Ci 6 à 6 
NHPN :? бурн ¢ PU o 
IAsj Belongingness C2 23 GNPH +» » Bi 
С = GNPH 7 0.5950 005 
Ec 23 HS @ 5.2812 .025 
* 22 HS 9 4.6451 .o3s" 
IA3i Competition Сї 164 HPN " "m баё 
IB: Aspiration C2 54 GNPH 32 doris m— 
IBs Curiosity ‚ЧЕН 42 HPN 20 7.1129 .o1 
HS 41 HPN 20 6.5574 .015 
РР = a e 38 HPN 20 4.9828 103 


* No individual contributes more th. 
b Also a difference in number of ind 


an one such r 
ividu: 


t esponse. 
als in each group. 


(Continued on next page) 
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2 2 
‘theme ‚ Group Group Chi-square p 
No. Name frequency frequency 
IB4 Behavior Disorder Ct 33 GNPH 17 4.5000 -035 
NHPN 33 GNPH 17 4.5000 .035 
185 Suicide HS 18 Cr 4 7.6818 .о1 
HPN IS Ci 4 5.2632 .o2sb 
IB; Guilt—Remorse HPN 34 NHPN 18 4.3269 o4 
HPN 34 HS 18 4.3269 o4 
\ IBro Rumination C2 18 HEN — 3 6.2609 5015 
C2 18 NHPN 0 5.0417 .025 
С2 18 HS 6 5.0417 .025 
Cr 15 HPN 5 4.0500 .045 
С2 18 СМРН 7 4.0000 +05 
IBir Occupational Dissatisfaction Gt 41 C2 19 7.3500 .o1 
Сї 41 GNPH 20 6.5574 .015 
Сї 41 HPN 22 5.1428 .02 
IB21 Exhaustion GNPH 17 C2 6 4.3478 -04 
IC2 Legal Restriction HPN 2 Cı 6 13.8286 .oo1” 
m ? HPN 29 NHPN 10 8.3077 1005 
HPN 20 C2 її 7.2230 .o1 
HPN 29 GNPH 14 5.9394 1015 
| HS 18 Сї 6 5.0410 025 
| IC3 Generalized Restriction Cr 17 GNPH 4 6.8570 „OI 
Ст 17 HS 4 6.8570 .о1ъ 
Cr 17 HPN 5 5.5000 .02 
4 C2 14 GNPH 4 4.5000 .035 
C2 14 HS 4 4.5000 035 
IC6 War С2 3ї HS 12 7.5348 .O1 
4 Cı 29 HS 12 6.2430 O15 
NHPN 28 HS 12 5.6250 .o2 
HB4 Retirement HS 32 cr 16 4.6875 +035 
HB7 Ordinary Activity GNPH 14 NHPN 4 359099. 935 
avora Environment Ст 9 GNPH 1 4.9000 103 
IC: Favorable En e 3 CNPH 1 rea A 
VÉ HPN 8 GNPH 1 4.0000 .05 
y LEVEL ОЕ INTERPRETATION 
I ы n Сї 22 NHPN 6 8.0357 Š 
HA Symbolic Сз 58 NHPN 6 pct pe 
Cr 22 HPN 8 5.6333 .о2 
С2 22 HPN 8 5.6333 .02 
HS 18 NHPN 6 5.0417 .025 
-— GNPH 21: C2 3 12.0417 d 
HI € Description GNPH 21 Gr 4 "ped [o 
GNPH 21 NHPN 5 8.6538 .005 
HPN 15 Ca 3 6.7222 .o15 
HS 14 C2 3 5.8823 .02 
HPN Is Ст 4 5.2631 .o2s^ 
HS 14 C1 4 4.5000 .03 
HPN 1:5 NHPN s 5.0500 025 
Сї 19 GNPH 6 5.7600 .02 


HID Unreal 


(Continued on next page) 
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Theme d 
Group Group Chi › 
А hi-square р 
Ме. Nome frequency frequency 
шс Autobiographical HPN 2s Ci 12 3.8919 05 
HII Alternate Themes HPN 45 Cr II 19.4464 -oo1 
NHPN 35 Cr її 11.5000 .oo1h 
GNPH 31 Gs II 8.5952 .005 
HS 3o Cr тї 5.5609 .o2b 
HPN 45 C2 23 6.4853 .015 
Ш J Comments GNPH 20 NHPN 4 9.3750 .005 
GNPH 2o C2 6 6.5000 .015 
HS 16 NHPN 4 6.0500 .015 
GNPH 20 HPN 8 4.3214 .04 
HIL Rejection HS 12 C2 т 7.692 ә? 
GNPH 9 C2 I pum +03 
HS 12 NHPN 2 5.7857 .02 
HPN 22 2 т 17.3913 .oo1b 
НРМ 22 МНРМ 2 15.0417 .oor 
HI M Peculiar Verbalizations HS I3 HPN* т 8.6429 .005 
III N Confused HS 18 Ст ї 13.4737 +001 
HS 18 NHPN 1 13.4737 oor! 
GNPH rr Cr I 6.7500 .ої 
GMPH п NHPN 1 6.7500 or 


° All other groups except GNPH have zero frequency. GNPH h 


significantly different from 13. 
4 Groups C2 and HPN have zero frequency. 


rotics tell certain kinds of themes which 
are distinctive, since this is the only 
group that differs from both normal 
groups and from a miscellaneous neuro- 
psychiatric group more frequently than 
could be expected by chance. 

‚ When the data are analyzed by indi- 
viduals, the latter statement must be 
even more circumscribed because on this 
basis significant differences are still less 
frequent. Now there are significant dif- 
ferences merely between each of the nor- 
mal groups and each of the clinical 
groups; but the number of these dif- 
ferences ranges from only one to three— 
less than could be expected by chance. 
Thus it can justifiably be said only that 
this particular group of hospitalized psy- 
choneurotics differs significantly more 
frequently than could be expected by 
chance from these particular normal 


аз frequency four which is not 


groups and this particular general neu- 
Topsychiatric hospital group. 

It is possible to rearrange the subjects 
so that there are larger classes and thus 
the comparisons can be based on more 
individuals as in Table 11. When all 
psychoneurotics, Groups (NHPN) and 
(HPN), are compared with all normals, 
Groups (C1) and (Cz), more significant 
differences appear. There are now 11 
such differences, almost twice what could 
be expected by chance, And when, as 
in Table 12, all hospitalized groups; 
(HPN), (HS), and (GNPH) are compared 
with all non-hospitalized groups (C1); 
(C3), and (NHPN), slightly more differ- 
ences emerge. It may be that our original 
six classes are not large enough to permit 
significant group differences to emerge- 

Some of the differences which are ob- 
tained when the subjects are regrouped 
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TABLE її 
AT OR BEYOND .os CONFIDENCE LEVEL BETWEEN ALL 


0 


NORMAL AND ALL PSYCHONEUROTIC SuBJECTS 


Dos Normal Psycho- Chi- P 

No. Nama neurotic Square 
IA:a Parental Pressure 116 81 5.8680 .o2 
IA:c Nurturance to Parent , 17 4 6.8751 .OI 
ТАЦ Death or Illness of Child | 31 12 7.5344 К 

Total Parental Disequilibrium 356 290 6.5402 .02 
IA2h Illicit Sex 54 29 7.5300 ої 
IA2j Death or Illness of Partner дт 68 6.2018 poe 
IA3j Belongingness 45 27 4.0139 .05 
1Вто Rumination E р 33 II 10.0227 .O1 
IBrr Occupa tional Dissatisfaction 60 39 4.0404 .05 
IC2 Legal Restriction М 17 39 7.8751 .o1 
ICs Generalized Restriction of Environment 31 12 7.5349 “ol 
IIA:d Reunion with Parents 2 10 4.0833 .o5 

Level 

ША Symbolic 44 14 14.5000 .оот 
Ill C Descriptive 7 20 5.3333 105 
ШЕ Fairy Tale 8 I 4.0000 .05 
NEI Alternate Themes 34 8o 17.7631 001 
HIL z 24 19.3600 NT 


DIFFERENCES IN THEMES SiGNIFI 


Rejection 


TABLE :2 
CANT AT OR BEYOND .05 CONFIDENCE LEVEL BETWEEN 


HOSPITALIZED AND Nox-HOsPITALIZED SUBJECTS 


Non- 


Theme š M 
a Nu OU ڪڪ ص‎ ccc Hospital- нере Chi- P 
No. Name ized ше Square 
Тата Parental Pressure 166 123 6.1038 ^" .оз 
IA1c Nurturance to Parent 20 s 7.8400 TE 
1А Departure from Parent 48 26 5.9595 jm 
IA:j Death or Illness of Child , 40 20 6.0167 ye 
Total Parental Disequilibrium $21 423 9.9671 of 
IA2h Illicit Sex 68 36 9.2404 сог 
IAsj Belongingnes$ . .. . 61 28 11.5102 бот 
Total Peer Disequilibrium .. , 348 289 5.2810 los 
Total Interpersonal Disequilibrium 1290 1127 10.9913 eax 
IB2 ]nadequacy 57 88 6.2069 102 
ІВто Rumination 39 18 7.0175 ‚от 
1B16 Religion |, 49 17 5.7857 .02 
IC2 Legal Restriction |. N 27 бт 12.3750 abi 
IC3 Generalized Restriction of Environment 38 13 11.2941 ness 
ICó War 88 58 5.7603 pe 
Total Disequilibrium +804 2580 25 ла "OOT 
IIB2 Tranquilit: 2 5.142 9 
Total Themes 3228 2982 9-7449 = 
Level 
ША Symbolic 5o 28 4.7619 .o5 
HI C Descriptive 2 ЕЧ 32.2908 Em 
HI G Autobiographical H E 0.3018 .02 
HII Alternate Themes 9 8.6087 +02 
ш] Comments 29 44 05 .02 
ING Rejections 3 43 33.0652 .00ї 
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into these larger classifications are in- 
teresting. The normal subjects give sig- 
nificantly more themes of parental dis- 
equilibrium than do the psychoneurotics, 
and within this category, more of both 
pressure from parents and nurturance to 
parents. These normals apparently pro- 
ject their parental difficulties into their 
fantasy more frequently than do these 
psychoneurotics. However, theirs is per- 
haps a more mature level of fantasy Ьс- 
cause a greater number of themes of 
nurturance to the parent may be an ex- 
pression of independence, even while 
more themes of parental pressure reflects 
a chafing at. parental restrictions. 

That the normal subjects contribute 
more themes concerning illicit sex may 
be due to fewer inhibitions in giving 
verbal expression to these thoughts—or, 
if one accepts the concept of need pro- 
jection, perhaps they have a greater im- 
pulse to express them verbally since they 
are finding less expression actually. The 
fact that they do not contribute as many 
themes of death of partner possibly indi- 
cates less hostility towards the opposite 
sex, which might very well be a function 
of the fewer inhibitions just noted. The 
normal subjects offer more themes of be- 
longingness, perhaps because, as college 
students, they are more concerned with 
problems of acceptance and fellowship; 
and the traditional preoccupation of col- 
lege students with world problems and 
man’s insignificance may be reflected in 
the preponderance of themes of rumina- 
tion. Bennett (8) also found sexual top- 
ics and “disillusionment” (similar to our 
"rumination") more popular with nor- 
mals. 

Some of these same differences appear 
when hospitalized and nonhospitalized 
subjects are compared; and new ones of 
interest are also revealed. Those in the 


hospital give significantly fewer themes 
of interpersonal disequilibrium, includ- 
ing fewer themes of disturbance in the 
parental situation and fewer themes of 
illicit sex and belongingness. In general, 
they give fewer themes of disequilibrium, 
perhaps as a result of their simplified. 
protected environment which removes 
them from many everyday struggles. 
This is further illustrated by the fact 
that they have fewer themes of general- 
ized restriction of the environment, yet 
more of legal restriction, the latter prob- 
ably a reflection of their own position in 
the hospital. They also seem to be less 
concerned with war than the other vet- 
erans. The hospitalized subjects offer sig- 
nificantly more themes of inadequacy, 
perhaps because of their own inferiority 
feclings which are intensified by the hos- 
pital experience. More religious themes 
might be expected of more seriously ill 
neuropsychiatric patients, while fewer 
themes of rumination 
lessened conce 
cal problems, 

, Although there is no indication of the 
significance, because chi square tests 


could not be performed, it is important 
to note those inst 
of theme in one 


might indicate a 
rn with other philosophi- 


ances where frequency 
i group was zero. This 
might indicate certain characteristics 
which are exclusively clinical or exclu- 
sively normal, at least for this sample. 
We find that no Subject in any of the 
non-hospitalized groups offers а theme 
of homosexuality, while at least one such 
response is forthcoming from each of the 
hospitalized groups and five from the 
schizophrenic classification, However, in 
the latter &roup there are only two indi- 
viduals who contribute such themes, 
since one contributes four of the five 
themes, Nor does a single theme of incest 
with siblings appear among the normal 


ША... 


ү 
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subjects, although there gre two such 
schizophrenic responses and one in the 
non-hospitalized psychoneurotic group. 
However, one theme of incest with par- 
ents is contributed by a normal, and one 
by a hospitalized psychoneurotic, but 
none by a schizophrenic or any other 
clinical subject. It cannot be said then 
that tabooed sexual themes (incest, rape, 
homos duality, etc.) are exclusively schiz- 
ophrenic as has been stated in the litera- 
ture (4.4). 

It is interesting that although each 
st one response of dis- 


group has at lee 
regard for peer, there is no such response 
in the schizophrenic group. The same is 
true of the theme, self pity. In general 
there is only an occasional theme which 
is found exclusively in schizophrenic or 
exclusively in psychoneurotic or exclu- 
For in almost 


sively in normal groups. 
ification to which one group 


every cl: 
some members of at least 


contributes 
one other group also appear. 


we have hypothesized about the 
ntergroup differences, 
seems to be 


Although p 
significance of some of the ir 
the outstanding finding of this study 
the similarity in fantasy content among our 
various groups of subjects. This is not at vari- 
ance with what has been reported by some inves- 
ligators using other projective techniques. Ма 
than 25 years ago Gardner Murphy (37) reportec 
that he could not establish any clear cut differ- 
ences among the responses of demen a рне 
тапіс depressive, and parcetic patients to thé 
Kent-Rosanolf word list on the basis of pre- 
ponderance of опе type of association. further 
study (38) of dementia praecox and тапіс aes 
pressive patients and normal subjects, classi fy- 
ing their responses according to the logical eta: 
tionship between $ and R. also failed to mmr 
entiate among these groups. McDowell (34) also 
found no difference between stuttering and non- 
stuttering children on the basis of the word 
association test. 

Anastasi and Foley (3) fo 
lapping in the type of subject m portraye 
by abnormal and normal groups 1n an investlga- 
Чоп of their spontaneous drawings. T heir findings 
are very similar to ours They say, Most of the 
subject matter categories do not differentiate 


und extensive over- 
matter portrayed 
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significantly between normal and abnormal 
drawings. Although certain types of drawings 
appear to be more clearly diagnostic of ab- 
normality, such drawings are made by a small 
minority of institutionalized patients. The ma- 
jority of abnormal patients, although showing 
clearly psychotic behavior in other respects pro- 
duced drawings which were indistinguishable 
from the normal in subject matter" (3. p. 174). 

Recent studies have been carried out on popu- 
lations similar to ours using the Draw-A-Man 
technique. Royal (48) found no statistically sig- 
nificant differences between neurotic and normal 
veterans on a check list of 28 drawing character- 
istics, although there was a tendency towards 
significance in eight of them. Albee and Hamlin 
(1) found that 15 clinical psychologists could not 
distinguish the drawings of psychoneurotic and 
schizophrenic out-patient veterans from each 
other. 

Using the verbal summator technique, Grings 
(24) found no discrimination in projective con- 
tent among 24 schizophrenics, 15 manic depres 
sive, depressed patients and 18 psychoneurotics. 


„Гһе fact that there are no differences 
in the thematic content of individuals of 
the various nosological groups has im- 
plications for the theory of behavior pa- 
thology. It is further evidence for the 
hypothesis of continuity between normal 
and abnormal, According to J. F. Brown, 
"abnormal psychological phenomena are 
simply exaggerations (i.c., overdevelop- 
ments or underdevelopments) or dis- 
guised (ie. perverted) developments of 
normal psychological phenomena" (11, 
p. 7). White (53) has pointed out how 
maladjustments arise out of each stage 
or process of normal growth, and Cam- 
eron (13) contends that normals, neu- 
rotics, and psychotics differ from one 
another only in degree and that any be- 
havior found in psychiatric patients is 
somehow related to normal behavior. If 
we accept the notion that "fantasy is 
driven by need" (39, p. 118), it can be 
predicted from any of these points of 
view that there would be no difference in 
the content of fantasy productions of 
individuals of different diagnostic cate- 
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gories since all individuals have the same 
needs, whether they are psychotic, neu- 
rotic or normal. And these are indeed our 
findings. 

If any one thing differentiates our sub- 
jects it seems to be the factor of hos- 
pitalization, which has been discussed 
above in the consideration of emotional 
tone and outcome. The effect of the ex- 
ternal environment on the content of 
fantasy productions has been pointed out 
by some investigators. 


Bumke and Kant (12), for example, report that 
patients in continuous hydrotherapy hallucinate 
lobsters and fish. Potter (4g) found environmetal 
factors, in this case a hospital stress situation, 
affected the fantasy of women awaiting gyneco- 
logical surgery and obstetrics so that their 
Rorschach protocols were more similar to each 
other than to a matched group of clerical 
workers not in a hospital situation. This “level- 
ling" effect, which also makes our hospitaliztd 
patients, regardless of clinical classification, more 
similar to each other than non-hospitalized sub- 
jects, is further evidence of the influence of sit- 
uational factors, e.g., limitation of environment, 
on fantasy productions. Bondy (10) in a discus- 
sion of the effect of internment camps on per- 
sonality organization, attributes to the social iso- 
lation the eventual similarities in personality 
which emerge. “In consequence of this isolation 
in the internment camps, all social veneer and 
superficial personality traits soon disappear. 
With startling rapidity the internees reach a 
common denominator and lose their individual- 
ity” (10, p. 462). Bettelheim (9) found signifi- 
cant differences when comparing old and new 
prisoners and the differences “seemed to origi- 
nate in personality changes brought about by the 
impact of camp experiences on the prisoners” 
(9, p. 437). The unnatural, restricted, and rou- 
tinized environment to which our hospitalized 
patients are exposed may easily have a similar 
kind of effect on their fantasy productions. 


F. LEVEL OF INTERPRETATION 


Since the analysis of the TAT by con- 
tent has failed to discriminate between 
diagnostic classifications, and since it has 
been suggested by some (32, 54) that 
analysis of the formal aspects of the pro- 
ductions would be more fruitful, the 


writer, in the course of analyzing the 
stories for themes, noted also when there 
were deviations from the instructions to 
make up a narrative. Of 16 different 
kinds of deviation from the instructions? 
these groups do not make up a homo- 
geneous population in five (Table 13). 
The specific inter-group differences sig- 
nificant at or beyond the .os level of con- 
fidence are noted in the latter part of 


TABLE 13 


LEVELS OF INTERPRETATION FOR WHICH ALL 
GROUPS or SUBJECTS Do Not Make Ur 
A HOMOGENEOUS POPULATION 


L Chi-Square of * 
evel Homogeneity В 
ПІ A Symbolic 20.1 ог 
HI C Description 27. 2559 +001 
ПІ I Alternate Themes 22.61 52 .oot 
HI J Comments 19.3000 Or 
HI L Rejection 21.3750 .oo1 


* 5 degrees of freedom. 


Table 10; and there are 29 such differ- 
ences, which is more than could be ех- 
pected by chance. The hospitalized psy- 
choneurotics reject more cards than do 
all other subjects, although each of the 
hospitalized groups rejects significantly 
more than each of the non-hospitalized 
groups. There are only two rejections 
among the non-hospitalized psychoneu- 
Fotics, one in one normal group and none 
in the other. Each of the hospitalized 
groups also gives significantly more de- 
scriptions 
group. The non-hospitalized subjects 
seem to be more willing or able to со- 
operate than the hospitalized ones, since 
they comply more closely with the di- 


rections. All of the clinical groups give . 


significantly more alternate themes than 
one normal group. (The other normal 
group also has less of these than any clini- 
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cal group, but the differenge is significant 
only when comparison is made with the 
hospitalized psychoneurotic group.) The 
excess of alternate themes in these groups 
is no doubt a function of indecision and 
uncertainty of the subjects in them just 
as the greater number of questionable 
outcomes is. Analysis by individuals does 
not change the picture very much, ex- 
cept that now fewer significant differ- 
ences appear, but all are in the same 
direction. 

In light of what has been stated in the 
TAT literature about the formal char- 
acteristics of stories of various clinical 
groups, it is appropriate to see how our 
data corroborate or are at variance with 
what has been reported. The fact that 
both groups of normal subjects give more 
symbolic stories? than each of the clini- 
cal groups, significantly more than the 
psychoneurotics, contradicts reports that 
this is a characteristic of the stories of 
Schizophrenics (44) and psychotic chil- 
dren (32). The normals also give more 
abstract stories, although this difference 
is not significant at the .05 level. It has 
been said that ambitendency in story 
trend is also a schizophrenic character- 


istic (46), yet as noted above this seems 


to be more characteristic of psychoneu- 


rotics. As compared with normals, the 


* An example of a symbolic story contributed 


by a normal subject is as follows: It looks like 
Dante's inferno. Some sort of monstrosity—a pri- 
meval picture—symbolic of evil. This creature 
еге is exploring the unknown—symbolic of man. 
Might even say it's all a picture of life—dark, 
cloudy, devastation. The way it’s going to turn 
Out is that this small creature is going to defeat 
this evil up here. More or less a symbolic ap- 
Proach. (Card 11 P 
| An uei a, an abstract story contributed 
y a normal subject is as follows: It looks lil e 
Salvador Dali, Well, this is an emotional pic 
ture. It shows happiness and contentment of this 
Couple despite storms and adversities. They are 
Secure in each other's love. Happiness is ex- 
Pressed here—the ultimate in marital bliss— 
Peace and contentment. (Card 19) 


schizophrenics do give more such themes, 
but they seem to be representative of all 
clinical groups and not distinctive of 
schizophrenics. Story continuations from 
one picture to the next has been listed 
as diagnostic of schizophrenics (44), but 
again our data show this to be more char- 
acteristic of normal stories (Table 11). 
Statements that the depicted situations 
are unreal are also more frequent among 
normals; and the difference is significant 
between one such group and the hos- 
pitalized general neuropsychiatric group. 
The schizophrenics and the general neu- 
ropsychiatric patients make significantly 
more comments about the picture than 
do both of the psychoneurotic groups 
and one of the normal groups. This too 
has been called a psychoneurotic char- 
acteristic (46). 

Other unusual kinds of interpretation 
are also more common among normals 
and often do not even appear among 
some of the clinical groups, as shown in 
the latter part of Table 11. For example, 
having the central character outside the 
picture is an interpretation given by six 
normal subjects and two schizophrenics; 
and three normals included the exam- 
iner in the story,? whereas only one each 
among the hospitalized psychoneurotics 
and the general hospital group did like- 
wise. This latter has been called an ex- 
clusively psychotic characteristic (32). As 
would be expected, peculiar verbaliza- 
tions, confused stories, and stories with 
no conceivable connection to the picture 
are significantly more characteristic of 
schizophrenics than other subjects, al- 
though the actual number of schizo- 
phrenics who produce stories with such 
characteristics are very few. 

The finding that there is more differ- 


5 One of the characters in cach of these stories 
was a "Dr. Eron." 
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entiation among groups on the basis of 
the formal characteristics of the stories 
than on the basis of content is compa- 
rable to the results of Anastasi and Foley 
(3). However, just as they report, none 
of the items occurs with a sufficiently 


high frequency in any of the groups to 
be regarded as a regular feature of its 
stories. Each of the deviations in the 
check list was observed in only a small 
minority of each population and very few 
of the items were absent from any group. 


IV. NORMATIVE DATA FOR THE ТАТ 


I г Is obvious that the TAT pictures 
themselves call forth certain kinds 
of stories with characteristic emotional 
tone and that they are as important as 
the clinical classification of the particu- 
lar subject in determining what kind 
of a response he will make. The identi- 
fication of the most popular responses to 
еасһ card has been designated a study 
of “picture pull.” It has been felt that 
each individual picture has its own stim- 
ulus properties which evoke themes, 
identifications, feelings, etc. which are 
peculiar to it and which differ from 
those elicited by other pictures. Appendix 
C lists for each picture the most fre- 
quent themes contributed by all sub- 
jects, classified by group. Also appearing 
in Appendix C are the frequency ratings 
of emotional tone and outcome, fre- 
quency of shift, type and frequency of 
all perceptual distortions, unusual atten- 
tion to details, and unusual interpreta- 
tions, Information of this type is indis- 
pensable for anyone who wishes to make 
à valid interpretation of a TAT protocol. 
It also gives a good indication of what 
cards should be used when there is in- 
sufficient time to administer the whole 
set of cards and the definite area of ad- 
justment is known about which informa- 
tion is desired. 

Certain pictures have been said to be 
more stimulating than others (31, 46). 
An analysis of the number of themes 
elicited by each card corroborates this 
claim, Table 14 lists the pictures in the 
order of their stimulatory value for these 
150 subjects. There was only one sig- 
Nificant intergroup difference in the num- 
ber of themes aroused by each picture. 
This was for Card 16, in which Group 
(C2) has significantly more themes than 


Group (GNPH) (chi square 6.1538, sig- 
nificant at .o5 level of confidence). 

Since the test is usually administered 
in two parts with one half of the cards 
presented in each session, it is desirable 
to know if there is any difference in the 
kind of story told for either half. All 
significant differences in themes and lev- 
els of interpretation for the first and 
second 10 pictures appear in Table 15, 
and there are significant differences for 
48 themes and eight levels, both many 
more than could be expected by chance. 
Themes and levels for which there is 
no occurrence in one or the other parts 
of the test are listed in Table 16. It is 
obvious from these two tables that each 
half of the test has a differential effect 
on the kind of stories it elicits. A study 
by Kannenberg (29) indicates that this 
is not a function of order of presenta- 
tion or special instructions to make the 
stories as imaginative as possible, as had 
previously been assumed (16). 


TABLE 14 


RANK ORDER or PICTURES ON BASIS OF NUMBER 
or THEMES WHICH EACH ELICITS 


Stimulatory 


Value Picture No. of Themes 
I 13MF 456 
2 20 421 
3 18BM 413 
4 6ВМ 395 
5 3BM 373 
6 4 361 
7 12M 352 
8 15 340 
9 7BM 316 

10.5 10 3or 

10.5 1;BM 301 

12 8BM 287 

13 oBM 273 

14 14 264 

15 5 255 

16 2 230 

17 Р 237 

18 19 225 

19 II 202 

20 16 199 
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TABLE 15 


© 


DIFFERENCES IN NUMBER OF THEMES IN PICTURES 1-10 AND 11-20 SIGNIFICANT AT OR 


BEYOND .o05 CONFIDENCE LEVEL 


Theme 
1-10 11-20 Chi-square P 
No. Name 
IA:a Pressure from Parents 260 29 183.0450 .ool 
IA:b Succorance from Parents 172 14 132.5215 .OOI 
ІА:с Nurturance to Parents 20 5 7.8400 +005 
TAik Disappointments to Parents 25 š 12.0333 ‚оо: 
IA:f Departure from Parents 73 11 44.2976 .оот 
IArg Concern of Parents 30 3 20.4848 „OOI 
IAn Death or Illness of Parents 79 16 41.7789 .ool 
TArm Filial Obligation 38 3 28.1951 „OOI 
IArp Marriage of Child 34 1 29.2571 NU 
Total Parent Disequilibrium 809 135 479-7976 .оот 
ТА га Pressure from Partner 102 24 47.0555 .ооІ 
IA2b Succorance from Partner 38 7 20.5000 .OOI 
IA2c Nurturance to Partner " 25 8 7-7575 "Өх 
IA2e Aggression towards Partner 8 35 15.7209 .OOI 
IA2f Departure from Partner 48 1 29.0909 .оот 
1А2ї Illicit Sex with Violence 1 16 11.5204 .ооІ 
1А2] Death or Illness of Partner 18 44 10.0806 1005 
IA2n Competition by Partners for Hero 34 4 22.1316 “ool 
IA2s Unrequited Love 5 19 7.0417 от 
IA2r Seduction by Partner . 17 3 8.4500 .005 
Total Sibling Disequilibrium © 29 12 6.2439 +015 
ІАза Pressure from Peer 8 44 23.5577 „оо 
IA3b Succorance from Peer I4 120 82.2761 “ool 
ІАзе Aggression towards Peer 23 66 15.6144 oor 
IAad Aggression from Peer 10 49 24.4746 "dol 
Iasi Death or Illness of Peer 16 41 10.1052 005 
A3j elongingness 2 : s 
IAsk Unappreciated by Peers i: š а КЕЛ 
ІАЗІ Competition from Peers 3 3o 29.2845. "oor 
Total Peer Disequilibrium 154 482 read 4 pe 
Total Interpersonal Disequilibrium 1396 1021 5 uo бот 
IB: Aspiration | 196 48 Fu put 
IB4 Behavior Disorder 56 тї aS 73 pass 
IB6 Moral Struggle 2 19059 k 
1 5 5 12.0333 „OOI 
IB; Guilt—Remorse 
44 99 20.2500 .oo1 
1В8 Drunkenness 26 106 à сәт 
1Вїт Occupational Dissatisfaction 129 28 M 76 o3 SOL 
IB:3 Retribution 23 8 3.6943 E 
1В1б Religion 14 Кы 8.1127 +005 
IB17 Loneliness s Q f 74.6746 sgor 
IB20 Vacillation 34 51 17.2537 400% 
Total Intrapersonal Disequilibrium 972 152 50.6807 4901 
ІС2 Legal Restriction 27 Es 21.0236 490% 
ICs Aggression from Environment E 12.4842 «002 
1C6 Escape from Perilous Environment 40 218 107.2283 “< 
“Total Impersonal Disequilibrium 5 52.0324 soar 
IIA:f Contentment with Parents xu 562 132.8426 oor 
IIA2c Contentment with Partner 43 1959207 ago 
IIA2d Reunion with Partner i 7 35.3152 ducc 
Total Partner Equilibrium тїт Ё 15.0417 .001 
IIA3d Peer Approbation i 29 46.8642 ‚оо1 
Total Peer Equilibrium $ 17 14.2223 .оот 
ПВ: Self Esteem 3 8o 46.8387 .oo1 
ПВ; Reminiscence, happy E ar 33.5814 .оот 
ПВ4 Кейгетепї si 37 9.4902 .005 
ПС Favorable Environment A 36 23.9427 . 0o1 
Total Impersonal Equilibrium 1 34 22.1316 oor 
Total Equilibrium 4 36 24.0250 .оо1 
334 404 6.4512 .OI5 


(Continued on next page) 
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o 
Theme 
1-10 11-20 Chi-square 
No. Name H 
Level 
ША Symbolic 16 
72 ‚58. 
n B Abstract 2 19 Bee 
H D Unreal 8 68 53-4405 
T G Autobiographical 31 78 42.3500 
ин Continuations 3 14 5.8823 
HII Alternate Themes 52 123 66.6475 
III J Comments 22 42 5.6406 
| HI L Rejection I3 33 7.8478 
j . 
TABLE 16 
THEMES AND INTERPRETATION LEVELS FOR WHICH THERE Is No FREQUENCY 
IN ONE HALF OF THE TEST 
Theme 
۲ _ 
No. Name ^E hod 
IA:h Incest 2 
ІАтп Confession to Parent 18 
IAro Bad News to Parent T 18 
ТА: Collusion with Parent 5 
IA4b Succorance from Sibling 1 
IA4e Aggression to Sibling 4 
IA3n Envy of Peer 4 
IA3m Hypnotism бт 
Nc IB:8 Compensation m 
| 1В23 Inconsequential Sadness 3 4 
IB24 Hurt Feelings 18 
1825 Jealousy (Unspecified) 3 
1В28 Homesickness 4 
IIA:b Resignation of Parent I 
IIAre Fulfillment to Parent 4 
ПА га Admiration of Partner 3 
ПАзс Reunion with Friends ^ 
IIA3e Exhibition 36 
IIB6 Resignation to Lot 3 
IIB7 Ordinary Activity uz 
IIC2 Rescue s 
B 
y Level of Interpretation 
7 
i ШЕ Fairy Tale m 


ШЕ Central Character Out 8 


V. SUMMARY AND CONCLUSÍONS 


A. Approximately 3,000 Thematic Ap- 
perception "Test (TAT) stories contrib- 
uted by 150 male veterans including 
50 college students, 25 non-hospitalized 
psychoneurotics, 25 hospitalized psycho- 
neurotics, 25 hospitalized schizophrenics 
and 25 miscellaneous neuropsychiatric 
patients, were analyzed for themes, level 
of interpretation, emotional tone, out- 
come, identification of characters, per- 
ceptual distortions and attention to de- 
tails. The data were arranged (a) accord- 
ing to total number of responses for all 
20 cards presented, and also (b) according 
to number of individual subjects con- 
tributing a certain number or more of 
à given response. Significant differences 
between groups were calculated on bóth 
of these bases by the chi square method, 
and all differences significant at or be- 
yond the .o5 level of confidence were 
noted. In all, 400 significant inter-group 
differences were noted, as shown in Table 
17. Аз seen in this table, the two most 
similar groups are (Ci) and (C2), the 
two normal ones, since there were only 
6 significant differences found between 
them, which is much less than would 
normally be expected by chance. Next 
in order of similarity are the normal 
groups (C1) and the non-hospitalized 
psychoneurotics (NHPN), and groups 
(C2) and (NHPN), with 10 and 11 sig- 
nificant differences respectively. Thus the 
three non-hospitalized groups are the 
most similar. Next in order of similarity 
are (HPN) and (HS), the hospitalized 
psychoneurotics and schizophrenics; (HS) 
and (GNPH), the hospitalized schizo- 
phrenics and the general neuropsychiat- 
ric hospitalized group; and (HPN) and 
(GNPH) each with 12, 16, and 17 sig- 
nificant differences respectively, The 
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groups with the number of significant 
differences between them closest to this 
are (NHPN) and (HS), with 25 such dif- 
ferences; all other "cross-comparisons" 
(i.e, comparisons between hospitalized 
and non-hospitalized groups) yield а 
larger number of significant differences— 
up to 46 for group (C1) vs. (GNPH). It 
would seem, then, that our six groups of 
subjects arrange themselves in two clus- 
ters according to whether or not they ате 
hospitalized, since the three non-hospital- 
ized groups are most similar, followed 
by the three hospital groups, and finally 
by comparisons between groups of the 
two diflerent classes. 

B. When each of the six groups was 
compared with every other group ОП 
each of the six variables specified below, 
not many more significant differences 
emerged than would be expected by 
chance. The findings can be summarized 
briefly as follows: 


1. Emotional tone. The most frequent stories 
of our entire group are moderately sad re- 
gardless of the clinical classification of the 
subjects contributing them. The hospitalized 
subjects more frequently tell stories which are 
neutral in emotional tone than do the non- 
hospitalized subjects. This appears to be more 
a function of hospitalization than of serious- 
ness of illness, because there is no steady pro- 
gression in the number of such stories from 
normals through psychoneurotics to psychotics. 

2. Outcome. The conclusions offered by the 
subjects seem to be more a projection of their 
own creativity than is the emotional tone, 
since there is considerably more inter-indi- 
vidual and inter-group variability for this 
dimension. However, the most frequent out- 
come is happy for all except the schizophrenic 
group, which contributes more neutral out 
comes. As in emotional tone the hospitalized 
Subjects give less extreme outcomes, either very 
happy or very sad, than do the non-hospital- 
ized subjects; and again the hospitalization 15 
considered the chief factor in accounting for 
this greater blandness. 

3. Shift. Slightly less than half of the sub- 
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o TABLE 17 
NUMBER OF SIGNIFICANT DIFFERENCES BETWEEN GROUPS 


T Rank 
Groups Themes Levels Tone ои ç Shit , DE a Total Grand Simila 
A. Total responses 
1-С2 2 1 1 Я 
Ci-NHPN I 2 2 z 6 
Cr-HPN 16 5 3 5 29 
C1-HS 12 4 3 6 5 3o 
C1-GNPH 14 E 5 6 3 š 34 
| C2-NHPN 4 2 3 3 P. 
$ C2-HPN 10 7 2 8: 22 
2 C2-HS 9 4 3 3 2 d 
j C2-GNPH 21 7 3 4 I š 37 
NHPN-HPN 8 3 3 5 1 26 
NHPN-HS 4 6 2 2 4 x 
NHPN-GNPH 10 4 3 5 E 1 20 
HPN-HS 5 2 2 x 19 
HPN-GNPH ц 1 : 1f 
HS-GNPH 1 1 3 bass 
Total 297 
B. Individuals 
1-C2 o 
Ci-NHPN 1 5 
Ci-HPN 3 2 1 a p 
C1-HS 3 1 x 5 
Ci-GNPH 2 а 4 2 9 
Ca-NHPN r : 
" C2-HPN 1 2 a š 
C2-HS 1 I 1 3 
C2-GNPH 2 ш 3 
NHPN-HPN 1 t 2 
NHPN-HS I т 
NHPN-GNPH 1 3 ^ с 8 
HPN-HS 
HPN-GNPH 3 ^ 
HS-GNPH 
Total 25 
C. Card by card Details 
Я Cio 1 I 2 6 1 
m Ci-NHPN ' : š 3 2 : 
Z Cr-HPN 4 3 Ü 4 ue 
Ci-HS 4 š 2 CENE ие 
Ci-GNPH 1 j | AE R 
C2-NHPN 2 
C2-HPN 3 т а i 7 бо: “у 
Cz HS k 2 1 5 29 9 
Co. GNPH 3 I 4 44 13.5 
NHPN-HPN 1 2 ° ° 3. à 
NHPN-HS 1 1 z s š A 1 
NHPN.GNPH š I 3 I £ 8 40 TIS 
HPN-HS 5 2 12 4 
HPN-GNPH xó ~ 
HS-GNPH а i 2 16 5 
Total Ë 
Grand total — = 
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jects terminate their stories with conclusions 
which are happier than the general tone of 
the body of the story. A shift to happier con- 
clusions is less characteristic of our schizo- 
phrenic subjects than it is of the normal ones. 

4. Perceptual Distortions. Misidentification 
of the sex of characters in the pictures, and 
other perceptual distortions, are frequent, and 
no more characteristic of any of our specific 
nosological groups than of our normal sub- 
jects. It was demonstrated that to use mis- 
identification of the sex of the figure in Card 
3 BM as an index of a high feminine com- 
ponent in the male personality is fallacious. 

5. Themes. When the individual groups are 
compared on the basis of our 125 theme check 
list only a few more differences appear than 
could be expected by chance. Themes of 
aggression, hostility, frustration, restriction, 
death, violence, and guilt are common to all 
groups. For no given theme does any one 
group of subjects differ from all others, and no 
group differs from both normal groups in 
more than five themes, The subjects were re- 
arranged into larger groups, and comparisons 
were made between all normal and all pSy- 
choneurotic subjects, and between all hospital- 
ized and all non-hospitalized subjects. On this 
basis more significant differences emerged 
than could be explained by chance. The 
nature of these differences, as well as of those 
among the individual groups, was discussed 
both in the light of the present literature on 
the TAT and possible personality dynamics. 

6. Level of Interpretation. In an attempt to 
see if more inter-group differentiations could 
һе made on the basis of a formal, rather than 
a content, analysis, the stories were classified 
according to deviations from the instructions 
to make up narratives; and many more signifi- 
cant differences were obtained than could be 
expected by chance. Again our results were 
compared with those reported in the litera- 
ture; and many of the differences which have 
been reported, when put to the statistical test, 


were found not only to be non-significant, but 
often reversed. 


С. We interpret these findings to mean 
the following: 


(1) Fantasy productions of various 

clinical groups do not differ sig- 

` nificantly from each other or 

from the productions of a nor- 
mal population. 

(2) Fantasy productions are affected 


by the external “objective” en- 
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vironinent of the individual, in 
this case a limiting, routinized, 
hospital environment. 

(3) The TAT, as a fantasy tech- 
nique, should not be used as а 
diagnostic instrument їп the 
sense of yielding "signs" or “рас 
terns” characteristic of specific 
nosological entities. 

D. In making a valid interpretation 
of a given protocol, due consideration 
must be given to the stimulus properties 
of the cards themselves, which appear to 
be as important a factor in determining 
an individual's response as the actual 
clinical group in which he may have been 
classified. As an aid to the proper utili- 
zation of this technique, a table has been 
prepared and is appended (Appendix C) 
which gives for each of the 20 cards in 
the adult, male series of the Third Edi- 
tion of the TAT, the most popular re- 
sponses contributed by 150 subjects in 
terms of emotional tone, outcome, pet- 
ceptual distortions, identification, atten- 
tion to details, and thematic content. 

E. Although the findings negate much 
of what has been reported in the ТАТ 
literature, they are not to be construed 
аз an argument against the validity of 
the instrument. The TAT was intro" 
duced to investigate the fantasy of nor 
mal individuals, It was assumed that 
from this fantasy there could be inferred 
Important strivings and needs in the 
narrator. Nothing here reported contra 
dicts this assumption, What is demon 
strated is the lack of justification for 
using the method as a diagnostic device 
for the differentiation of nosological 
groups. Schizophrenics, psychoneurotics 
and normal persons all have the same 
needs and strivings, preoccupations and 
attitudes, delusions and misconceptions: 
There may be a difference in emphasis 
and degree from individual to individ- 
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ual, but not from group t» group. Nor- 
mal persons check up on their ideas, and 
correct their delusional thinking through 
techniques of social participation, role 
taking, and the sharing of perspective, 
more readily than individuals with be- 
havior disorders, in whom these skills 
are not as well practiced. But the fact 
remains, according to the evidence of the 
present study, that we all fantasy much 
the same things. Therefore, a technique 
such as the TAT cannot be used as 
a diagnostic instrument in the sense of 
pigeon-holing people into different 


Kraepelinian categories. If we mean by 
diagnosis the determination of the dy- 
namics and content of an individual per- 
sonality and the understanding of his 
preoccupations and conceptions, the 
TAT may well be of great value. It is a 
diagnostic instrument in the sense that 
it gives an understanding of the indi- 
vidual in his own life-setting. The re- 
sults of this study indicate that it does 
not yield differential patterns for classi- 
fication into Kraepelinian nosological en- 
tities. 


APPENDIX А - 


RATING SCALES FOR EMOTIONAL TONE OF ТАТ STORIES 


Key for all scales 
—2 Very sad 
—1 Sad 

o Neutral 
+1 Happy 
+2 Very happy 


General Rating Scale for Emotional Tone of 

Stories: 

—2 Complete failure, submission to fate, death, 
murder, suicide, illicit sex with violence, re- 
venge, aggressive hostility, severe guilt, com- 
plete hopelessness. 

—1 Conflict with attempt at adjustment, rebel- 
lion, fear, worry, departure, regret, illness, 
physical exhaustion, resignation toward 
death, loneliness. 

9 Description, lack of affect, balance of posi- 
tive and negative feelings, routine activities, 
impersonal reflection. 

+1 Aspiration; desire for success and doubt 
about outcome, compensation for limited 
endowment. Description with cheerful feel- 


ing, reunion with friends, contentment with 
world, feeling of security, 


+2 Justifiably high aspiration. Complete satis- 


faction and happiness. Reunion with loved 
ones. 


? Can't make up a story. 


General Rating Scale for Outcomes: 


—2 Complete failure, submission to fate, death, 
murder, suicide, extreme punishment, ex- 
{теше remorse. 

—1 Some frustration: incomplete success in at- 
taining goal, goal attained at expense of 
happiness, disappointment to friends and 


family, acceptance of unsatisfactory situation 
or submission to authority. 


Continuation of ordinary situation, 
of happy and unhappy situations, 
+1 Moderate success, reunion with friends, re- 
covery from temporary disability or depres- 
sion, happiness in success of others. 
+2 Great success, discovery, and/or happiness. 
Extreme contentment, marital bliss, un- 
usual good fortune, reunion with loved ones. 
? Cannot give an outcome, even when explic- 
itly asked for, conditional (if) outcomes, 


alternative outcomes of different emotional 
value. 


balance 


Rating Scales for Emotional Tone of Individual 
Cards: 


(1) 
—2 Complete frustration and hopelessness with 
no resistance. 
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—1 


0 


1 


L 


Dejected, inadequacy with attempt to adjust- 
Frustration with no depression. spirations 
balanced by conflict. Lack of fe ling tonc. 
High aspirations with cooperation but some 
hindrance. 

High aspiration. with approbation and no 
conflict. 


(2) 
Conflict between ambition and duty to fam- 
ily with some feelings of guilt or apprehen- 
sion, disappointment with lot, jealousy. 
Description of picture, planning for future 
with no apparent affect. 
Future planning, description with cheerful 
feeling. 
Complete satisfaction with 
and life's accomplishments. 


(3BM) 
Uncontrolled emotionality, murder, men- 
tally ill, complete frustration, death of loved 
one, suicide. 
Self-pity, aggressive parental pressure, trans- 
itory depression, adolescent confusion, phys- 
ical incapacity. 

(4) 

Desire for revenge, murder, aggressive hos- 
tility. 
Disillusionment, occupational failure, COn- 


flict over extva-marital relations, jealousy, 
pressure from mate, 


(5) 
Overdominant parent or mate, extreme 
parental or partner pressure, aggressive hos- 
tility, murder. 
Parental or partner pressure without aggres 
sion, loneliness, slight frustration. n 
Checking on room or occupants, doing 
household duties, 


Unexpected gift, welcome guest, good news. 


(6BM) 
bad news, severe guilt, conflict over 
Social acceptance of sexual role. 
Parental pressure, filial 
over desires and 
parental home, 


Death, 


x " " ict 
obligation, (o 
duties, departure fror 


(7BM) R 
Disappointment of parents in child, guilt. 
repeated failure, 

Disagreement, recalcitrance, rebellion against 
Parental authority, feelings of inadequacy: 


Parental advice, impersonal discussion. 
counselling, 


> 4 


# 


t 
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Aspiration with encouragement and/or ad- 
vice. 


(8BM) 


2 Murder, death, extreme guilt. 


Worry and concern about accident and 
operation, frustration of ambitions. 
Descriptive, perfunctory, lack of emotional 
involvement, impersonal reflection. 
Aspiration, hope and planning for futurc. 
Adventurous daydreaming. 


(9BM) 
Economic misfortune, physical exhaustion, 
danger of combat, social disapproval. | 
Men at rest, pure description, no emotional 
involvement. 
Comradely feeling, contentment, carefree, 
happy-go-lucky, lack of concern for conven- 
tion. 

(10) 
‘Death, extreme sorrow, tragedy. 
Departure, leaving loved ones, personal 
failure, being comforted for minor misfor- 
tune. 
Lack of affect, balance of conflict. Р 
Reunion, happiness, acceptance, feelings of 
pleasure. А 
Marital bliss, extreme contentment, satis- 
faction and good adjustment. 


(11) 
Life is futile, horrible, complete absence of 
hope, no avoidance of fate. Death, destruc- 
tion, war. 
Struggle against aggressive forces, 
fighting, story detached. from rcality. 
No emotional involvement, little interper- 
sonal action, description. 
Vacation, pleasure trip, happy people. 


(12M) 
Death, suicide, malpractice 
with aggression, rape, curse. 
Reconciliation. to death; illness, parental 
pressure. А 
Hypnosis with no harm involved (experi- 
mentation with or demonstration of hyp- 
nosis). Being awakened from sleep. 
Reunion. 


animals 


(hypnotism) 


(13MF) 
Illicit sex with violence, rape, death, murder 


for infidelity. " 
Disillusionment with sexual experience, re- 


gret for illicit scx. Illness of wife. 


(14) 

—1 Resignation to death of relative. Reflection 
on worldly conflicts, with or without appeal 
to religion, loneliness. 

o Daydreaming without emotional involve- 
ment, any other theme with no emotional 
involvement, adolescent reverie. 

-p1 Contentment with environment, apprecia- 
tion of world around. 

--2 Happy, well-adjusted hero, vacation, plan- 
ning for happiness. 


(15) 

—2 Death of close relative, loneliness for de- 
ceased, mourning, hopelessness, hero re- 
jected by society, suicide. 

—1 Impersonal speculation on death, return of 
dead to cemetery, visiting grave of friend. 

o Description of painting or picture, no affect. 


(17BM) 

—1 Vindictiveness, revenge, trying 10 escape 
from unfavorable environment, fear, inade- 
quacy. is 

o Vacillation in plot (balance of happy and 
? sad themes), doing routine job of acrobatics. 

-+-1 Compensation for limited endowment, desire 
for success with uncertainty about outcome. 

+2 Hero happy and successful. Display of phys- 
ical prowess, adulation of crowd, winning of 
contest, 


(18ВМ) 

—2 Suicide, manslaughter, 

hallucinations, delusions. 

—1 Environmental frustration, accident, ordinary 

drunkenness, personal sorrow not of serious 

proportion. 

o Description of poster or painting. No emo- 

tional involvement. 


(19) 

— Death due to forces of nature or war. 

—1 Fear (child’s fear of supernatural), bad storm 
with little or no emphasis on comfort of 
home. 

o Description of picture. 

+1 Comfort of home during storm, feeling of 

security, 


thwarted escape, 


(20) 

—2 Death of loved one, suicide. 

—1 ‘Disappointment in love, worry, feeling of 
rejection, economic pressure, disillusionment, 
loneliness. 

о Out for a walk, description, no feeling tone. 
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APPENDIX B n 


CHECK LIST OF THEMES AND INTERPRETATION LEVELS 


I. Disequilibrium (tension) 
А. Interpersonal 


1. 


ю 


Parent (includes parent substitutes such 

as older authority figures, etc.) 

a. Pressure—parent or parent figures are 
prohibitive, compelling, censuring, 
punishing, disapproving, interfering, 
checking up, disagreeing with, quarrel- 
ing with, restraining or unduly influ- 
encing child. 

b. Succorance—child seeks or receives aid, 
help, advice, consolation, from parent. 

c. Nurturance—child bestows or offers aid, 
advice, consolation to the parent. 

d. Aggression from—physical harm inflic- 
ted or intended upon child by parent. 

e. Aggression to—physical harm inflicted 
or intended upon parent by child. 

f. Departure—child is taking leave of 
parental home, is separated from par- 
ents. 

8. Concern—parent is worried over phjsi- 
cal or mental well-being of child. 

h. Incest—actual or contemplated. 

i. Death or illness of parent. 

1. Death or illness of child. 

k. Disappointment to—parent is disap- 
pointed їп child's behavior or ac- 
complishments. 

l. Disappointment 
pointed by parent. 

m. Filial obligation—child feels it is his 
duty to remain with, comply with, or 
support parents. 

n. Confession—child tells parent of some 
misdemeanor or crime he has com- 
mitted. 

о. Bad news—child brings bad tidings to 
parent (e.g, death of other parent, 
brother, etc) or friend brings tidings 
of death of son, 

p. Marriage—child tells parent of past or 
impending marriage, parent objects to 
marriage. 

q. Collusion—parent and child are plan- 
ning or executing together some anti- 
social act. 

т. Parental conflict-child is concerned 
over marital problems of parents. 

5. Lost—child has been abandoned by 
parents or is lost. 


in—child is disap- 


. Partner (wife, sweetheart, Opposite-sexed 


peer). 

a. Pressure—partner is prohibitive, com- 
pelling, censuring, punishing, quarrel- 
ing with, etc. 


* Themes in 
in original sam 
included for completeness, 


b. Succorance—hero receives or seeks aid, 
comfort, consolation, assistance from 
partner. 

c. Nurturance—hero bestows or offers to 
partner, aid, consolation, advice, etc. 
d. (Aggression from—physical harm in- 
flicted upon or intended for hero by 

partner.)* . 

e. Aggression to—physical harm inflicted 
upon or intended for partner. 

f. Departure from—hero is leaving part- 
ner either temporarily or permanently 
or is separated from partner. | 

g. Concern—hero is worried over physical 
or mental well-being of partner. 

h. Illicit sex—extra- or pre-marital hetero- 
sexual relationship, non-incestuous, in- 
cludes “petting.” 

i. Illicit sex with violence—rape. 

j. Death or illness of partner. ХР 

k. Disappointment to-partner is disap- 
pointed in hero's behavior or accom- 
plishments. 

l. (Disappointment by—hero is disap- 
pointed by partner), А 

m. Jealousy-hero is jealous of partner's 
attention to others, 

n. Competition—hero is object of compe- 
tition between two or more admirers. 

о. Cuckold—hero discovers partner has 
been having extra-marital relations or 
finds her raped. 

P. Decision—hero must choose between 
marriage and not or between two part- 
ners, 

q: Pursuit-hero is wooing or trying to 
Bet partner to submit. 

т. Seduction—hero is being talked into rc- 
lationship or is being actively pursued. 

5. Unrequited—hero's love is unreturned, 
impotence. 

t. Restraint-hero and partner can't be 
married or have intercourse (because of 
economics, deformities, menstrual pe 
riods, etc.) 

u. Childbirth 


‚ Peer (like-sexed, approximately same age); 


a. Pressure—like-sexed peers are prohibi- 
tive, compelling, censuring, punishing, 
disapproving, interfering, restraining 
etc. 


b. Succorance—hero secks or receives aid, 


parentheses have not been found 
Ple of 150 protocols but are 


А NORMATIVE STUDY OF THE ТАТ 35 


advice, consolation, fron peer. 

C. (Nurturance—hero bestows or offers aid, 
advice, consolation upon peer). 

d. Aggression from-physical harm in- 
flicted upon or intended for hero by 
peer. 

€. Aggression to—physical harm inflicted 
upon or intended for peer by hero. 

f. Desertion—hero has been deserted by 
his companions or is lost Irom them, or 
is departing from them. 

g- Concern—hero is worried about physi- 
cal or mental well being of his friends. 

h. Homosexuality 

i. Death or illness of friend. 

j. Belongingness—desire expressed to be 
with or accepted by peers. 

К. Unappreciated by peers. | 

1. Competition—hero is competing with 
peer in game or contest. 

m. Hypnotism 

n. Envy—hero is envious of peer’s accom- 
plishments, capacities, etc. 

o. Revenge—hero is anxious to exact re- 
venge from a peer for some past deed 
or is taking the revenge. 

p. Disregard for—hero is unconcerned over 
welfare of his peers. 

q. Bad in[luence-hero's friends аге con- 
sidered unfavorable associates by him, 
his family, or narrator. 


4. Sibling 


a. Pressure—sibling is prohibitive, com- 
pelling, censuring, punishing, disap- 
proving, interfering, restraining, etc. 

b. Succorance—hero seeks or receives aid, 
advice, consolation from sibling. —— 

c. (Nurturance—hero bestows or offers aid, 
advice, consolation upon sibling). x 

d. (Aggression from—physical harm in- 
flicted upon or intended for hero by 
sibling). 

е. коч to—physical harm inflicted 
upon or intended for sibling by hero. 

f. Rivalry—hero and sibling are competi- 
tors for same objective. 1 

E. (Concern—hero is worried about physi- 
cal or mental well-being of his sibling). 

h. Incest—with sibling. 

i. Death or illness of sibling. 


В. Intrapersonal 


1. Aspiration—dreaming of future, hoping 
for future, determination. s 

8; Inadequacy—realization, whether. Дын 
fied or not, of lack of success; individua 
is at a loss to cope with situation. 

3. Curiosity wondering about G 
tion of object, contents of room, eic; 
desire to observe, inquire, explore, in 
Vestigate; to acquire facts. 


4. 


Tis 


Behavior disorder—personal -maladjust- 
ment of all sorts, neurosis, psychosis, 
sleeplessness, hallucination, withdrawal, 
eccentricity, morbid preoccupation. 
Suicide—attempted or completed, pre- 
occupation with. 

Moral struggle—concern over what is 
right and wrong, hesitancy in indulging 
in some act because of ethical prescrip- 
tions. 


- Guilt—remorse. 
. Drunkenness 


Fear—hero is apprehensive, alarmed, ter- 
rified, of some person, thing, force. 
Rumination—inability to understand 
world and its problems, adolescent rev- 
erie, man's insignificance, "what am I?" 
"what's it all about?" 

Occupational concern-deciding between 
jobs, considering vocations, dissatisfied 
with present employment, fails at pres- 
ent employment. 


. Physical illness or death of central char- 


acter—(other than when in a peer, part- 
ner, sibling, parent relationship, which 
are included in those specific categories) 
other than suicide. 

Retribution—forced to atone or be 
punished for some antisocial act. 


. Reminiscence, sad-individual is unhappy 


in his memories of the past or contem- 
plation of future. 
Intra-aggression—does physical harm to 
self, short of suicide (which is carried 
under that heading), either thru acci- 
dent or deliberate. 

Religion—prayer, seeking consolation 
from God, religious conflict, religious 
awakening, 

Loneliness—central character misses 
someone, is an outcast, friendless, home- 
less. 

Compensation—when individual has one 
characteristic or stroke of fortune to 
make up for another bad characteristic 
or misfortune. 

Vacillation-wasting time, putting off a 
distasteful task, procrastination, loitering. 
Acquisition—desire expressed by central 
character to acquire material things, or 
is working greedily for possession of 
goods; miserliness. 

Exhaustion — characters 
“pooped” from overexertion. 
Revenge—hero is preoccupied with wish 
to retaliate for some past wrong from 
unspecified individual (when peer, car- 
ried under that heading). 


completely 


- Sad over death of dog, broken toy, loss 


of object, etc. 
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24. “Hurt feelings"-central character is very 
sensitive over some happening, slight, in- 
sult, etc. 

25. Jealousy-of unspecified individual (if 
specific person named, carried under 
appropriate heading). 

26. Self-pity—hero feels sorry for self. 

27. Resurrection-return of the dead, super- 
natural. 

28. Homesickness—expressed by central char- 
acter. 

29. Grief-over loss of some unspecified indi- 
vidual. 


C. Impersonal 


1. Economic pressure—individual is com- 
pelled to, or prohibited from, or limited 
in, doing something because of the lack 
of money. 

. Legal restriction—individual is incarcer- 

ated, arrested or detained against his will. 

Generalized restriction—environment is 

generally frustrating because of backward- 

ness, danger, lack of opportunity, etc.; 
hopelessness of life. 

4. Aggression towards environment—robbery, 


ы 
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b. Congerality—sociability, friendliness, 
satisfactory affiliation. 
c. Reunion—with friends. z 
d. Approbation—hero is lauded, appreci- 
ated, by friends and peers. 
е. Exhibition—hero is amusing, entertain- 
ing, attracting other: 
f. Ordinary activity—everyday association 
with nothing particular happening. 
4. Sibling (no II themes found in original 
sample of 150 cases). 


B. Intrapersonal 


1. Self esteem, confidence, belief in own 
superiority, 

ranquility-pcace of mind, content with 
environment and own accomplishments, 
enjoyment, aesthetic appreciation. 

3. Reminiscence, happy—individual is con- 
tent with his memories. 

4. Retirement—central character is asleep: 
resting, no physical exhaustion, 

5. Occupational satisfaction 

б. Resignation lo lot—individual has re- 
signed himself to whatever situation. 


= 


1 7. Ordinary  activity-individual is going | 
accident, murder of unspecified indi- about his everyday activities. 
vidual (if individual named, carried C. Impersonal 


under the proper heading). 
5. Aggression from—impersonal source, acci- 
dent, animal, nature, disease, etc, 
. War 
. Escape from perilous environment—indi- 
vidual is in the act of getting out (doesn't 
include rescue, which comes under II). 


ы а 


1. Favorable environment-individual is be- 
ing helped by favorable circumstances, is 
enjoying his surroundings. 

2. Rescue—successful escape from perilous 
environment or cessation of noxious stim- 
uli (already accomplished). 


HI. Level of interpretation 


п. Equilibrium (adjustment) 


A. Symbolic—depiciting an idea or moral. 
A, Interpersonal В. 


1. Parent 

a. Cooperation—parent is working with 
child towards his own goal. 

b. Resignation-parent is resigned to child's 
activity. 

c. Idealization—child idealizes parent. 

d. Reunion-of child with parents. 

e. Fulfillment—child lives up to expecta- 


Abstract-depicting a feeling. 


C. Descriptive-no action depicited, no могу, 


D. Unreal— 


E. Fairy tale—legend 
F. Се 


mere d 


ription of picture. 

This is a picture,” “This is some- 
one's dream," etc.—can't accept as à real 
situation, 


‚ impossible happening. 
ntral character is not in picture. 


G. Autobiographical—where narrator is char- 


tions of parents. i 
e acter in th “lik ed to 
(A Contentment—blissful home situation. me,” doces UR pa whee happen 
g. Ordinary familial activily—just talking H. C sii. renes touselt. " 
spending an evening кз Bs . Continuations-refers to past stories. 
m А s 1. Alternate themes given for same picture. 
а. Admiration J- Comments about artistic merits of picture: 
b. Cooperation— recognition of artist or painting. 

b n—between mates, so that K. Denial of a th DELE 5 еп it 
both are working for same end and ar i = tache 
happy. е i К commonly given. 

& Conte "р h . Rejection 
=a ntment—mutual love and affec- M. Peculiar verbalizations 
d. Reunion N. Confused, no single discernible plot, im- 
е. Отайпату activity possible conclusion, etc. 
3. Peer а Includes examiner in story. 
› : ic- 
à. Cooperation—working together for same { co: Iy no connection of story to pic 
oal. А 
Б О. Humorous 


e APPENDIX C 


NORMATIVE DATA FOR THE TAT" 


lhis appendix includes a card by card tabula- 
tion of ТАТ responses analyzed in terms of the 
following variables: 

A. Emotional tone of stories 

B. Outcome of stories 

C. Shift in emotional tone from body of story 
to outcome 

D. Most frequent themes (included are all 

themes contributed by at least 5 subjects) 
E. Identifications of characters and objects (for 

cards on which such identification is not 
uniform, and five or more subjects produce 
the divergent identification) 

F. Auention to details (for cards on which five 
or more subjects single out some unusual 
or minute aspect of the picture for clabor- 
ation in their stories) . 

The population on whom this analysis is 
based includes 50 normal male college students, 
25 non-hospitalized male ps choneurotics, 25 
hospitalized male psychoneurotics, 25 hospital- 
ized male schizophrenics and a group of 25 
males representing a general neuropsychiatric 
hospital population. All groups аге similar as to 
age, education, intelligence, and ma al status 
and all subjects аге veterans of World War П. 
For the purposes of presentation in this appen- 
dix these 150 subjects have been divided into 
two groups of 75 cach. Group Н consists of the 
hospitalized subjects and Group NH of the non- 
hospitalized ones. This division of subjects into 
two groups is justifiable since the more detailed 
anal in the text reveals that hosptalization is 
the chief significant factor differentiating these 
subjects. The numbers appearing in the body of 
the Appendix represent percentages based on a 
total N of 75 in cach group. A 

Readers desiring these data broken down into 
the individual groups, with statistically signifi- 
cant differences noted where they occur, may 
Obtain the same from the author at the Depart- 
ment of Psychology, Yale University, 333 Cedar 
St, New Haven, Conn. 


Card 1 
A. Emotional Tone Ratings 
—2 =! о +1 +2 
H 0.0 29.3 53:3 13-3 4.0 
NH 1.3 34-7 30-7 24.0 9.0 
Moa frequent story is neutral to moderately 
sad. 


3rd Edition, Har- 


1 
Included are all 20 cards ( ett 


vaste University Press) recommended for 
adult males (41) 
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The hospitalized subjects contribute signifi- 
cantly more neutral stories. 


B. Outcome Ratings 

= —1 o +1 +2 ? 

1.3 8.0 37-3 34-7 8.0 8.0 

NH oo 17.8 30.7 25. 17.8 8.0 
Most frequent outcome is happy. 


C. Shift 
Happier Sadder No Shift 
H 38.7 53 56.0 
NH 373 9:3 53-3 


About half of the subjects shift, 
Significantly more subjects shift to happier than 
sad conclusions. 


D. Most Frequent "Themes? 


H NH 
1. 181° Aspiration 28.0 46.7 
2. ІАла Parental pressure 38:5 41.3 
3-,JA3j Belongingness 20.0 25.3 
4 IB Curiosity 219 10.7 
5. 1B»  Inadequacy 21.33 9.3 
6. 1811 Occupational dissatisfaction 6.7 8.0 
7. IBıg Vacillation 53 58 
8. 1C1 Economic pressure 18 5-3 
HI С Autobiographical 93 40 


“Contributed by five or more subjects and 
listed in decreasing order of frequency. 

" Code number to facilitate looking up defini- 
tion on check list. 


Card 2 
А. Emotional Tone Ratings 
k —1 о +1 +2 
H 0.0 347 44.0 21.3 0.0 
NH 2.7 58-8 28.0 16.0 0.0 


Most frequent story is moderately happy. 
Hospitalized subjects contribute more neutral 
stories (not significant). 


B. Outcome Ratings 
—2 —1 o +1 +2 ? 
H 0.0 8.0 48.0 46.7 0.0 0.0 
NH 2.7 18.7 26.7 45:3 4.0 1.3 
Most frequent outcome is happy. 
Hospitalized subjects contribute significantly 
more neutral outcomes. 


C. Shift 
Happier ^ Sadder No Shift 
H 413 53 53.0 
мн 49-3 10.7 40.0 


About half of the subjects shift. 
Significantly more subjects shift to happier 
than sad conclusions. 
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D. Most Frequent Themes 


H NH 
1.1811 Occupational Concern 28.0 33.3 
2. IB1 Aspiration 26.7 33.3 
3. 1С1 Economic pressure 17-9 10.7 
4. IA1a Parental pressure So 14.7 
5. IA1b Succorance from parent 8.0 145 
6. IIB2 Tranquility 8.0 12.0 
7- IAim Filial obligation 10.7 8.0 
8. IA1f Departure from parent 8.0 8.0 
9. IA2m Competition for partner 53 40 
10. ТАЗ} Belongingness 27 40 
11. IIC3. Generalized restriction оо 67 
12. IIB; Occupational satisfaction 40 2.7 
HI c Descriptive 18.3 5.3 
ША Symbolic 27 10.7 
HI J Comments about artistry, еіс. 5.3 2. 
HI D Unreal 40 27 
E. Identifications 
H NH 
1. Father-Mother-Daughter 40.0 29.9 
2. Husband-Wife-Unrelated girl 105 64 
*3. Son-Mother-Daughter 40 16.0 
`4- Son-Mother-Unrelated girl 13 40 
5. Brother-Sister-Sister 10.7 %3 
6. Man-two women (all unrelated) 6.7 2, 
7. Two characters only specified 93 173 
8. Four characters specified (includes 
figure in extreme background о 53 


EE 
9. Only one character specified (girl) 9.5 14.7 


F. Attention to Details 
1. Pregnancy of women on left 25.3 22.7 
2. Books held by young girl 22.7 24.0 


* Difference is significant at or beyond .o5 
level. 


Card 3 BM 
A. Emotional Tone Ratings 
—2 —1 o +1 +2 
H 52.0 44.0 2.7 1.3 оо 
мн 52.0 46.7 1.8 0.0 0.0 


Preponderance of stories are moderatel 


y to 
very sad. 


B. Outcome Ratings 


us am 9 tr +2 ? 
H 24.0 9.3 18.7 28.0 0.0 


20.0 
NH 25.3 10.7 21.3 26.7 2.7 18.9 
Most frequent outcome is moderately happy. 
C. Shift 
Happier ә Sadder Мо Shift 
46.7 13 2 
NH 52.0 2 E 


About one half of all subj 
pier conclusions. 


о Оле 1 i£ 
z 
nly rare individuals shift to sadder conclu 


J 45-3 
ects shift to hap- 


D. Must Frequent Themes 


H NH 

1. 1B5 Suicide 26.; 16.0 

2. IA1a. Pressure from parents 200 21.3 

3. IB4 Behavior disorder 13.3 20.0 

*4. IB; Guilt-remorse 22.7 80 
*5. IC4 Aggression to environment 18.7 53 
6. IAii ‘Death of parent 93 13-3 

7. 1812 Physical illness or death of 

D central character 1333 93 

8. IC» Legal restriction бо 67 

9. IB2ı Exhaustion 13.3 67 

10. IBig. Retribution 93 93 
11.182 Inadequacy 53 133 
12. ІСі Economic pressure 93 93 
13. 1824 Hurt feelings 40 80 
14. IAga Pressure from partner 67 40 
15. IA2s Unrequited love 40 490 
16. IA2j Death or illness of partner 40 40 
17. ТАзе Aggression to partner 25 40 
18. IBio. Rumination 1g 67 
19. IC5 Aggression from environment 2.7 4:0 
20. IB7 Loneliness 53 > 
ШІ Alternate Themes 93 10:7 
III G Autobiographical 93 13 
HI K Denial of a theme 58 107 


* Difference is significant at or beyond .05 
level. 


E. Identification 
1. Confusion of sex 


a. Misidentification immediate 25.3 307 
b. Uncertain at first and then mis- 

identify 147 18.7 
с. Uncertain at first and then iden- 

tify correctly 40 10-7 
d. Cannot decide whether male or 

female &o 1⁄4 


More than half of all subjects are confused 
over the sex of the central character. 
2. Identification of gun 
a. Questions whether it is a gun 147 92 
b. Identifies as something else, e.g., 


scissors, comb, knife, etc. 5.3 193 
€. Questions whether something else 27 53 
d. Can't decide what it is 4o 53 
F. Attention to Details 
1. Concern with object 400 42-7 
2. Comments about figure, eg. 
hunchback, deformed, etc, 93 173 
3. Specify locale as prison 13.3 40 
4- Specify locale as mental hospital 4 53 
Card. 4 
A. Emotional Tone Ratings 
—2 —ї o +1 = 
Mit 2.7 84.0 10.7 1.3 5 
49 047 xs 8 0 


Preponderance of stories is moderately sad. 


r 
/ 
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B. Outcome Ratings 
—1 o + +2 ? 


° ? 
H 6.7 10.7 213 400 1.3 18.7 
1.3 


NH 18.7 16.0 53-3 1.3 9.3 
Most frequent outcome is moderately happy. 
C. Shift 
Happier Sadder No Shift 
H 573 53 374 


NE 69.3 13 29.4 
More than half of all subjects shift to happier 
conclusions. 
Only rare individuals shift to sadder con- 
clusions, 


D. Most Frequent Themes 


H NH 
1. IA2a Pressure from partner 440 50 
2. IA2b Succorance from partner 
3. IA2m Competition 
4. IA2f Departure from partner 
5. IA2r Seduction 93 107 
6. IAge Aggression to peer 93 93 
7. 1811 Occupational concern 10.7 80 
8. IA2m Jealousy of partner 167 6.7 
9. TAgh Illicit sex 40 120 
10. 186 Moral. struggle 4.0 12.0 
11 ІСі Economic pressure 8.0 8.0 
1з. IB4 Behavior disorder 8.0 6.7 
13. 187 Guilt-remorse 67 40 
14. 188 Drunkenness 67 40 
15. IA2p Decision between partners 40 4.0 
16. IBı Aspiration 8.0 оо 
17. IB24. Hurt feelings 53 13 
18. IC; Aggression from environment 4.0 2.7 
19. ICG War оо 40 


E, Identification of Characters and Objects 


1. Woman in background seen as real, 
not picture 

- Woman in foreground seen as ne- : 
gress 27 6.7 

3. Woman in foreground seen as nurse оо 9.8 


107 80 


ю 


F. Attention to Details 
- Mention of picture in background 347 42-7 


Card 5 
A. Emotional Tone Ratings 
—2 —ї о +1 +2 
H 1 72.0 8.7 о.о 
8 173 7 
us 8o 480 333 67 40 


Most frequent story is neutral to moderately 


sad. 

The hospitalized subjects contribute signifi- 
cantly more neutral stories and the non-hos- 
pitalized subjects significantly more moderately 
Sad ones, 


B. Outcome Ratings 


—2 —1 о +1 EE ? 
H 1.8 6.7 65-8 17:8 0.0 9.3 
NH 67 133 373 173 53 67 


Most frequent outcome is neutral. 
Significantly more hospitalized subjects con- 
tribute neutral outcomes. 


C. Shift 
Happier Sadder No Shift 
H 22. 2:7 748 
NH 36.0 8.0 56.0 


The majority of subjects do not shift for this 
picture. 
D. Most Frequent Themes 


H NH 

1. IB Curiosity 42.7 41.8 
2. ТАла Pressure from parent 33.9 40.0 
3. IAsh Illicit sex 8.0 20.0 
4. IAig Parental concern 10.7 12.0 
5. IBg Fear 10.7 12.0 
6. ТАза Pressure from partner 10.7 4.0 
7. 184 Behavior disorder ay 8.0 
8. IAgg Concern for partner 40 2.7 
9. 1817 Loneliness 40 297 

Сата 6 ВМ 
А. Emotional Tone Ratings 

—2 —1 о ES 2 

H 18.3 77-3 9.3 0.0 0.0 
NH эз. 7 0.0 0.0 0.0 


Preponderance of stories are sad. 
Neutral stories rare and contributed only by 
hospitalized subjects. 


B. Outcome Ratings 
—2 —1 o +1 +2 ? 

H 2.7 14-7 21.3 37.8 2.7 21.3 
МН 173 17.3 20.0 46.7 2.7 10.7 

Most frequent outcome is moderately happy. 

Significantly more non-hospitalized subjects 
contribute extremely sad outcomes than do hos- 
pitalized subjects. 


C. Shift 
Happier Sadder No Shift 
H 58.7 24 39.6 
NH 58.7 5-3 36.0 


More than half the subjects shift to happier 
conclusions. 


D. Most Frequent Themes 


H NH 

1. IA1a. Pressure from parents 373 46.7 

з. IAif Departure from parents 24.0 36.0 

3. IAıp Marriage of child 16.0 21.3 

4. ТА1Ь Succorance from parents 22:7 133 
5. IC4 Aggression towards environ- 

ment 18.3 21.3 
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6. [Aim Filial obligation 93 187 3. Outcome Ratings 

7. IÀ10 Bad news to parent 10.7 13-3 PIE ë + T 5 
8. 1А11 Death or illness of parents 10.7 10.7 


9. IAin Confession to parent 5.3 16.0 
о. IAik Disappointment to parent 10.7 12.0 
11. IA1j Death or illness of child 6; 93 


12. IA1g Parental concern 105 27 
13. IB11 Occupational concern 53 5:3 
14. IAıc Nurturance to parent 53 40 
15. 187 Guilt—remorse 15 67 
16. 1С5 Aggression from environment 6.7 1.3 
17. 1816 Religion 40 27 
18. ICı Economic pressure 8: 4.0 
19. 166 War 27 40 
ШІ Alternate Themes 67 6.7 
HI G Autobiographical 27 40 
Card 7 BM 

A. Emotional Tone Ratings 

—2 —1 о +1 2 
H 0.0 40.0 50.7 9.3 оо 
NH 53 58.7 32.0 4.0 0.0 


Most frequent story is neutral to moderately 
sad. 


B. Outcome Ratings ° 
—2 —1 о +1 +2 ? 
H 1.3 9:3 34-7 40.0 1.8 13.3 
NH 93 13.3 22.7 45-3 5-3 4.0 
Most frequent outcome is moderately happy. 


C. Shift 
Happier Sadder No Shift 
н 41.3 4.0 54-7 
NH 48.0 8.0 46.0 


About half of the subjects shift, the majority 
to happier conclusions. 


D. Most Frequent Themes 


H NH 
1. IAib Succorance from parent 800 74.7 
з. IA1a Pressure from parents 21.3 28.0 
3. IB11 Occupational concern 21.3 25.9 
4. IB1 Aspiration 93 93 
5. 1C4 Aggression towards environ- 
ment 10 8.0 
6. IB8 Drunkenness 53 53 
7. IB2 Inadequacy 25 бо 
8. ILAif Contentment with parent 6.7 a 
9. [B10 Rumination 27 5 š 
10. IAiq Collusion with parents 2.7 Adi 
11. [Aga Pressure from partner 1.3 Lai 
Card 8 BM 
А. Emotional Tone Ratings 
—2 — ° +1 A 
H uh 58.7 18-3 21. is 
NH 18.8 40.0 7 š 
About half of all stori > TE s 
ries are mo 


derately sad. 


H 1.3 6.7 10.7 547 53 18.7 
NH 8o 9.3 10.7 46.7 10.7 14.7 

About half of all outcomes are moderately 
happy. 


C. Shift 
Happier Sadder No Shift 
H 50.7 4.0 15-8 
NH 34-7 6.7 576 


About half of all subjects shift, almost all 
positively. 


D. Most Frequent "Themes 


н NH 
*i. IBı Aspiration 26.7 533 
2.166 War 24.0 293 


з. IAıi Death or illness of parent 25:3 18.7 
4. IBı2 Death or illness of central 


character 21.3 147 
5. IC; Aggression from environ- 

ment 13-3 86 
6. 1814 Reminiscence, sad 9:3 6.7 
7. IAgi Death or illness cf friend 107 2 
8. IBıı Occupational concern 67 5% 
9. IA4k Death or illness of sibling 80 13 
10. IBio. Rumination oo 93 
11. IC4 Aggression towards environ- 

ment 53 49 
12. [Age Aggression to peer 67 13 
HIN Confused 67 13 


* Difference is significant at or beyond +05 
level. 


E. Identifications of Characters and Objects 


1. Misidentify sex of figure in fore- 


ground xg 19 
2. Misidentify sex of figure on table 5-3 Eu 
3. Background is actual scene 947 aoe 
4. Background and foreground are 
two different events separated in 
time ug 49 
5. Reverses so that figure on table 2 
dreams of figure in foreground 40 53 
6. Figure in foreground "does not 
belong" 67 13 
7. Figure on Table is Father 200 173 
8. Figure on Table is Mother 40 4d 
9. Figure on Table is self igo 187 
10. Figure on Table is friend 2 € 


F. Attention to Details 


1. Mention presence of gun 33-5 
2. Give location of wound 
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Card o BM? 
A. Emotional Tone Ratings 


—2 —1 o +1 2 
H 1.3 26.7 65.8 6.7 0.0 
NH 0.0 26.7 48.0 24.0 1.3 


Most frequent story is neutral, 
Non-hospitalized subjects contribute signifi- 
cantly more moderately happy stories. 


B. Outcome Ratings 


—2 —1 o +1 +2 ? 
H 25; 23 778 93 0.0 6.7 
NH 1g зо 560 20.0 27 8.0 


Most frequent outcome is neutral. 

А š y, 

Of questionable outcomes 1n NH group, 90% 
are contributed by psychoncurotics. 


C. Shift 
Happier Sadder No Shift 
H 20.0 9:3 70.7 
NH 22.7 12.0 65.3 


Less than one third of subjects shift from emo- 
tional tone of story to outcome. 


D. Most Frequent Themes 


H NH 
1, HB4 Retirement 76.0 68.0 
2. IB21 Exhaustion 14:7 147 
з. 1819 Vacillation 93 33 
4. HB2 Tranquility 4.0 16.0 
5. ICi Economic pressure 9:3 9:3 
6. ICG Маг 67 93 
7. ILAgb Congeniality with peers 4.0 8.0 
8. ICg Generalized restriction 53 67 
9. ICs Legal restriction 53 27 
10. 1Agb Succorance from peer до 40 
11. 188 Drunkenness 538 13 


12. IC5 Aggression from environment 93 2-7 
13. IIB Reminiscence, happy 2.7 10 
ШІ Alternate Themes 53 53 


E. Identifications of Characters 


H NH 
1. Laborers 453 5 
2. Tramps 32.0 29-3 
3. Soldiers 80 1 17 
4. Negroes 12.0 7 
F. Attention to Details 
12.0 147 


1. Specify number of men 


Сата 10 
A. Emotional Tone Ratings 
2 —1 o +! +2 
Н 8.0 25.8 4.0 37.9 24.0 
NH 12.0 48.0 1.8 24.0 17-8 
About half the subjects contribute happy 


stories and half the subjects, sad stories. Only 
rare individuals tell neutral stories. 

Significantly more non-hospitalized subjects 
tell moderately sad stories. 


B. Outcome Ratings 
= Et o +1 +2 ? 
H 53 1.8 160 32.0 200 21 
NH 9.3 8.0 8o 413 213 6.7 
Most frequent outcome is moderately happy. 
Significantly more hospitalized subjects con- 
tribute questionable outcomes. 


% 


C. Shift 
Happier Sadder No Shift 
H 28.0 6.7 65.8 
NH 42.7 18.8 0 


About half the subjects shift, significantly 
more to happier conclusions. 


D. Most Frequent Themes 


H NH 
1. ПАәс Contentment with partner 54.7 45-3 
^. IA2c — Nurturance to partner 12.0 20.0 
3. IA2f "Departure from partner 10.7 21.3 
1 Reunion with partner 173 80 
5 War 


6. IAıj Death. or illness of child 
2. МР Departure from parent 
8. HBg Reminiscence, happy 
9. IAgg Concern for partner 


10. IC5 Aggression from environ- 

ment gj 6.7 
ıı. Пла Reunion with parent 53 40 
12. IAgg Pursuit of partner 53 2 
13. 181 Aspiration 2. 4.0 
14. IBı2 Illness or death of central 

character 27 40 


E. Identification of Characters and Objects 


1. ‘Two males 5.8 3 
2. Two females оо 2.7 
3. Young couple 20.0 13.3 
1. Middle aged couple 65; 8.0 
5. Old couple 16.0 20.0 
6. Mother and Son 8.0 12.0 
Card 11 
А. Emotional Tone Ratings 

—2 —1 o +1 +2 
H 10.7 41.3 40.0 1.3 0.0 
NH 10.7 60.0 13.3 10.7 0.0 


Most frequent story is moderately sad. 

Significantly more hospitalized subjects give 
neutral stories and significantly more non-hos- 
pitalized subjects give moderately happy ones. 


42 LEONARD D. ERON 


B. Outcome Ratings 
—2  —1 o +1 +2 ? 
H 10.7 27 887 26.7 27 12.0 
NH 22. 188 25.3 21.3 2.7 9.3 
Most frequent outcome is neutral. 
Significantly more non-hospitalized subjects 
give sad outcomes. 


C. Shift 
Happier Sadder Мо Shift 
H 38.7 9.3 52.0 
NH 32.0 14.7 53-4 


More than half the subjects do not shift. Of 


those that do significantly more shift to happier 
conclusions. 


D. Most Frequent Themes 


H NH 
1. 1С5 Aggression from impersonal 
source 41.33 41.3 
2. IC7 Escape from peril 30.7 24.0 
3. IAge Aggression from peer 18.7 24.0 
4. 183 Curiosity 80 16.0 
5. IAgb Succorance from peer 27 53 
б. IBg Fear 40 до 
7. 1316 Religion 53 27 
8. [Big Illness or death of central 
character 40 эл 
9. 166 War 40 25 
“II C Description 173 40 
HID Unreal 53 93 
HI A Symbolic 67 В.о 
HIE Fairy Tale 1.3 12.0 


* Difference is significant at or beyond .05 
level. 


Card 12M 
A. Emotional Tone Ratings 


—1 о +1 +2 
a 25.9 453 24.0 53 0.0 
NH 26.7 42.7 28.0 2.7 


0.0 


Most frequent story is moderately sad; stories 


only rarely happy. 


B. Outcome Ratings 
—1 о +1 +2 ? 
H 193. 53 255 373 27 ооо 
NH 17.9 9.3 24.0 доо 4.0 53 
Most frequent outcome is happy. 
Of the non-hospitalized subjects, no normal 
subject gives a questionable outcome. 


2 


C. Shift 
Happier Sadder No Shift 
H 48.0 6.7 45-3 
NH 54-7 8.0 38.3 


More than h 


2 alf the subjects shift to happier 
conclusions, 


© C. Shift 
H NH 
1. IAgm Hypnotism 40.0 40.0 
2. 1816 Religion 25.9 17:3 
*3. IBı2 Illness or death of central 
character 347 67 
4- IB4 Behavior disorder 173 16.0 
5. HB5 Retirement 22.7 104 
*6. IAij Death or illness of child 6.7 24.0 
7. IAgb Succorance from peer 173 10.7 
8. IAgd Aggression from peer 13.3 93 
*9. 183 Curiosity n3 133 
10. IC; Aggression from impersonal 
source х 2; 80 
11. [Ara Pressure. from parents 10 67 
12. IA1b. Succorance from parents 13 53 
13. IAga Pressure from peer 53 13 
14. IC4 Aggression towards impersonal 
source 2.7 4.0 
15. IIAge Exhibition 13 53 
16. HB3 Reminiscence, happy oo 6.7 
III I Alternate Themes 120. 6.7 


* Difference is significant at or beyond .05 
level. 


E. Identification of Characters and Objects 


1. Misidentify sex of character on bed 13.3 12.0 
2. Characters are same age (young) 12.0 80 


Card 13MF 
A. Emotional Tone Ratings 


—2 —! ° + +° 
н 58.7 347 6.7 0.0 0.0 
NH 453 49.3 53 0.0 0.0 


Preponderance of stories are sad. Rare indi- 
viduals contribute a neutral story. 


B. Outcome Ratings 


== 


? 
2 —1 о +1 2 

H 187 253 13.3 21.9 оу 200 
NH 267 147 147 293 27 120 


Most frequent outcome is moderately happy: 
but not significantly more frequent than moder- 
ately or extremely sad outcomes, which when 
taken together are significantly more frequent 
than happy outcomes, 


С. Shift 
Happier Sadder No Shift 
H 30.7 27 66.6 
NH 46.7 10. 42.6 
About half the i 


subjects shift, significantly 
more to happier conclusions. 


D. Most Frequent Themes 
H NH 
of partner боо 42:7 
45 547 


1. IA2j. Death or illness 
2. IB7 Guilt-remorse 


T——————————————— е 
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*3. IA2h Illicit sex ® 20.0 42.7 
4. IAge Aggression towards partner 16.0 17.3 
5. 113 Retribution 13.3 16.0 
б. 188 Drunkenness 10.7 133 
7-1C1 Economic pressure u7 93 
8. IA2i Illicit sex with violence 10.7 93 
9. IB2 Inadequacy 133 40 
10. IB; Suicide 10.7 67 
11. IA2u Childbirth 8o 53 
12.184 Behavior disorder 57 6.7 
13. IAgm Jealousy 40 80 
14. IA20 Cuckold 67 40 
15. IAga Pressure from partner 10 53 
16. IB21 Exhaustion 40 40 
17. IAgb Succorance from peer 67 оо 
18. 1816 Religion 40 2 
19. IC; Aggression from impersonal 

source 40 2.7 
HII Alternate Themes 173 133 
II G Autobiographical 67 67 
HI K Denial of a theme BS 2. 


* Difference is significant at or beyond .05 
level. 
E. Identification of Characters 


1. Father-Daughter, or Mother-Son 67 1.3 
. Sister-Brother 40 13 


L 


F. Attention to Details 


1. Mention books on table 12.0 13.3 
2. Mention nudity of woman 12.0 12.0 
Card 14 


A. Emotional Tone Ratings 
—2 —1 o +1 +2 
H 93 187 453 2% 4.0 
NH 2; $33 33:3 200 9.3 
Most frequent story is neutral to moderately 
sad—not significantly greater than moderately 
happy. 
B. Outcome Rating 
—2 „=й o +1 +2 ? 
H 2 5.9 8. 25.3 67 22. 
2:7 58 397 25 
NH 2. 135 267 360 16.0 9:3 
Most frequent outcome is neutral to moderately 
happy. 


C. Shift 
Happier Sadder No Shift 
24.0 6.7 69.3 
NH 353 93 574 


Less than half of the subjects shift. ot those 
that do shift significantly more shift positively. 


D. Most Frequent Themes 


2 
BSS 


1912 ç 
os 


н 
1. 183 Curiosity 25.3 
2. IB; Aspiration 2.7 


3. IIB3 Reminiscence, happy 10:7 12.0 
4. HB8 Ordinary activity 147 40 
5. 184 Behavior disorder 4.0 14.7 
6. IBio. Rumination 5.3 12.0 
7. IICı Favorable environment 6.7 10.7 
8. IIB2 Tranquility 4.0 10.7 
9. 1B17 Loneliness 8.0 5.3 
10. IC Generalized restriction 4.0 7 
11. IB; Suicide 67 з. 
12. 1811 Occupational dissatisfaction 4.0 5.3 
13. 1816 Religion 40 4.0 
14. 1Сб War БЕ 5-8 
15. IC5 Aggression from impersonal 

source 97 53 
16. IBıg Vacillation 40 2.7 
17. 1C1 Economic pressure oo 65 
ША Symbolic 1.3 10.7 
III I Alternate Themes 65 13 
HI G Autobiographical 53 8 
HI J Comments about artistic 

merits of picture, etc. 67 оо 


E. Identification of Characters and Objects 
1. Individual climbing in window 8.0 оо 


т Card 15 
A. Emotional Tone Rating 
—23 —1 о +! +2 
H 5 37:3 9:3 0.0 0.0 
NH 12.7 4.0 0.0 0.0 


Most frequent story is extremely sad. Neutral 
stories are rare. 


B. Outcome 
—  —1 о + +2 ? 
H 20.0 160 22.7 16.0 40 280 
NH 45-9 93 178 5.9 40 10.7 
Most frequent outcome is sad. 
Hospitalized subjects have significantly more 
questionable outcomes. 


C. Shift 
Happier Sadder No Shift 
H 40.0 4.0 56.0 
NH 22.7 10.7 66.6 


Less than half the subjects shift. 
Significantly more shift to happier than 
sadder conclusions. 


D. Most Frequent Themes 


H NH 
1, IAsj Death or illness of partner 45.3 32.0 
2. 1B16 Religion 453 29.3 
3. IAşj Death or illness of peers 18.7 21.3 
4. 1B17 Loneliness 14.7 10.7 
5. 184 Behavior disorder 53 16.0 
6. 183 Reminiscence, happy 107 4.0 
7. 1B; Guilt—remorse 67 80 
8. IBg Fear 67 6.7 
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9. IA1j Death or illness of child 27 80 11. IB2 Inadequacy 53 13 
10. IC6 War 53 53 12. 1Bio Rumination 7 40 
11. IA3j Belongingness оо 10.; 13. ICs Aggression from environment 2.7 40 
12. IAge Aggression towards peer 40 65  141C3 Generalized restriction 13 53 
1g. IA1i Death of parent 53 27  15.llAid Reunion with parents 27 40 
14. IAgb Succorance from peer 53 92 III G Autobiographical 24.0 13.8 
15. IBio Rumination 27 53  *HIL Rejection 173 13 
16. IA4k Death or illness of sibling 40 2.7 *1 Н Continuation of previous 
17. 183 Curiosity оо 6; story 19 10.7 
18. IC4 Aggression towards environ- HIC Description 67 18 
ment 27 40 * Difference is significant at or beyond .o5 
III D Unreal 107 173 level. ИЕ 
HII Alternate Themes 67 93 түне 
ПІ J Comments about artistic pu 1; BM : 
merits of picture 93 27 A. Emotional Tone Rating 
ША Symbolic 53 53 —2 —1 о +1 +2 
HI p Abstract 2 л H 24 24.0 38.7 30-7 4.0 
IH N Confused 67 oo NH Lg 29.9 26.7 33-3 9:3 
Most frequent story is neutral to moderately 
E. Identification of Characters and Objects happy. 
1. Misidentify sex of character 4.0 40 B. Outcome Rating 
—2  —1 o 1 2 ? 
Card 16 H 80 Во 307 ES do 16.0 
A. Emotional Tone Ratings NH 67 по 240 38.7 10.7 4.0 
=e pre o БЕЯ +2 Most frequent outcome is moderately happy. 
H 23 17-3 38.7 20.0 5-8 Significantly more hospitalized subjects con- 
NH 8.0 25.3 25.3 24.0 12.0 tribute questionable outcomes. 


Most frequent story is neutral. 
Moderately happy and moderately sad stories 
are of equal frequency. 


B. Outcome Ratings 
—2  —1 o + +2 ? 
H 4.0 40 387 24.0 4.0 9-3 
NH 8.0 80 307 25.3 20.0 2.7 
Most frequent outcome is neutral. 
Significantly more non-hospitalized subjects 
contribute extremely happy stories. 
No normal subject has a questionable out- 
come. 


C. Shift 
Happier Sadder Мо Shift 
218 9-3 69.4 
NH 28.0 10.7 61.3 


About two thirds of the subjects do not shift. 
Of those that do, significantly more shift to 
happier conclusions. 


D. Most Frequent Themes 


H NH 

1. ПС1 Favorable environment 12.0 12.0 

2. IC6 War 8o 145 

3. IIA2c Contentment with partner 8.0 ES 

4. HB2 Tranquility 53 8o 

5. IIB7 Ordinary activity 6.7 53 

6. IIAgbCongeniality with peer 67 ee 

7. IIA2c Ordinary Activity 2:7 Ea 

8. IA1a. Pressure from parents 2.7 d 
9. IAif Departure from parents зл иб 
10. IAgb Ѕиссогапсе from peer 40 эл 


C. Shift 
Happier Sadder No Shift 
H 18.7 14.7 66.6 
NH 21.3 16.0 63.7 


About one third of the subjects shift and of 
these there is no significant difference in thosc 
shifting to happier or sadder conclusions. 


D. Most Frequent Themes 


H NH 
1. ПВІ, Self esteem 14.7 307 
2. HAge Exhibition 16.0 22:7 
3. HAgi Competition with peer 12.0 187 
4. IC7 Escape from peril 173 17: 
5. 1В3 Curiosity 16.0 80 
6. IB1 Aspiration 147 5% 
7. 1С2 Legal restriction 120 80 
8. ПАЗаАрргођацоп from peers ovy 229 
9. IB18 Compensation б 40 193 
10. HB2 Tranquility до 80 
11. IIB6 Resignation to lot їз 10:0 
12. IC5 


Aggression from impersonal 
source бт 4^ 


13. IB2 Inadequacy 40 53 
14. IBiz Illness or death of central 

character arg, 90 
15. IC6 War 27 53 
16. IIB7 Ordinary activity 40 40 
17. IBi4. Vacillation ag 40 
III Alternate Themes 147 49 
Ш С Autobiographical 53 53 
Ш A Symbolic ap dn 


= 


an AT Ж 


— V^": 
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E. Identification of Characters and Objects 


H NH 

1. Character is going up 46.7 40.0 

2. Character is going down 22.7 29.3 
3. Uncertain if character is going up 

or down 40 27 

4. Character is going up and down 53 13 


F. Attention to Details 


1. Note. muscles 26.7 24.0 
. Note nudity 13.3 8.0 


ю 


Сата 18 ВМ 
А. Emotional Tone Ratings 


А —2 —1 o 2 
H 17.3 64.0 10.7 0.0 
NH 24.0 747 5-8 оо 


Most frequent story is moderate 
Neutral and moderately happy stories are un- 
usual, 
B. Outcome Ratings 


o +1 +2 ?‏ ود 
H 21.3 26.7 253 0.0 173‏ 
NH ojo 227 227 227 оо — 80‏ 


All categories of outcome are about equally 
frequent except questionable. ui 

Of questionable outcomes in non-hospitalized 
group 40% are contributed by psychoneurotics. 


C. Shift 
Happier Sadder No Shift 
H 40.0 4.0 66.0 
NH 36.0 10.7 63.3 


More than half the subjects do not shift. of 
those that do, significantly more shift to happier 
Conclusions, 


D. Most Frequent. Themes 


H NH 
1. 188 Drunkenness 37-3 493 
2. ТАЗ Succorance from peers 38.7 387 
3. IAga Pressure from peers 21.9 26-7 
„£ IB4 Behavior disorder 14.7 253 
5: IC2 Legal restriction 28.0 120 
б. Aga Aggression from peer 13.3 173 
7.1812 Illness or death of central 
character 173 1938 
А 1С4 Aggression towards imper- 

sonal object 18-8 n 
s 183 Guilt—remorse 107 40 
n. 1518 Retribution 147 27 
NS Aggression from impersonal a es 

1 Source ў 9 
t Tum Hypnotism 40 5 
n ма Pressure from parents 4.0 = 
15 ne Pressure from partner E c 
16 I Ы Inadequacy 2 53 

130 Fear 27 4 
[5 40 27 


Occupational concern 


III I Alternate Themes 13.3 14-7 
ША Symbolic 5g 67 
шр Unreal Б 40 эй 


* Difference is significant at or beyond .о5 
level. 


E. Attention to Details 
1. Dwell on. number of hands 18.7 173 


Card 19 

A. Emotional Tone Ratings 
=й о +1 +2 
H 27 24.0 41.3 16.0 0.0 
NH 9.3 33-3 14.7 33-3 2.7 

Most frequent story is neutral to moderately 
sad. 

Significantly more hospitalized subjects give 
neutral stories, 


1 0 


B. Outcome Rating 

+1 2 ? 

H 2.7 2.7 $20 21$ о.о 25. 

NH 18.7 193 307 21.9: 93 6. 
Most frequent outcome is neutral. 
Hospitalized subjects give significantly more 

questionable outcomes and significantly less sad 

ones. 


—2 —1 о 


a б 


C. Shift 
Happier Sadder No Shift 
H 17.3 6.7 76.0 
NH 20.0 24.0 66.0 


More than two thirds of subjects do not shift; 
and of these that do, there is no significant dif- 
ference between the number giving sadder and 
happier conclusions. 


D. Most Frequent Themes 


H NH 
1. 1C5 Aggression from impersonal 
source боо 61.3 
2. ПА1Ё Contentment at home 20.0 26.7 
3. IBi9 Vacillation 8.0 8.0 
4. IBg Fear 9.7 104 
5. IB12 Physical illness or death of 
central Character оо 6.7 
6. 184 Behavior disorder оо 8.0 
7. 1816 Religion 40 27 
8. IC6 War 1:35 Б 
шр Unreal 13.3 25.3 
*HI С Descriptive 14.7 ay 
*HI L Rejection 1.3 12.0 
ш G  Autobiographical 27 65 
HII Alternate themes 93 2.7 
III B Abstract 27 6.7 


* Difference is significant at or beyond .o5 
level. 
E. Identification of Characters and Objects 


1. House in snow storm 440 547 
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2. Ship in storm 8.0 93 
3. Igloo or quonset hut 40 4.0 
4. House in flood &o 1.3 
5. Ghosts and "spooks" 1⁄3 12.0 
Card 20 

A. Emotional Tone Ratings 

—2 —1 о 4+1 +2 
н 6.7 41.9 34-7 2.7 о.о 
NH 13.3 747 9-3 2.7 0.0 


Most frequent story is moderately sad. 

Non-hospitalized subjects contribute signifi- 
cantly more sad stories and significantly less 
neutral ones than the hospitalized subjects. 


B. Outcome Ratings 
ex = o +! 42 ? 
H 93 107 360 240 оо 17.3 
NH M 173 213 333 40 93 
Most frequent outcome is neutral to moderate- 


ly happy. 
C. Shift 
Happier Sadder No Shift 
H 33-3 8.0 58.7 
NH 48.0 10.7 41.3 


About half of all subjects shift, significantly 
more to happier conclusions. 


D. Most Frequent Themes 


H NH 
1. IBig Vacillation 80.0 70.7 
2. IC1 Economic pressure 17 24.0 


3. 1С5 Aggyassion from impersonal 
source 13.3 16.0 
4. IC4 Aggression to impersonal ob- 
ject 10.7 8.0 
5. 188 Drunkenness 93 93 
б. IB11 Occupational concern 59 120 
7. lÀgj Belongingness 24 133 
8. 1B17 Loneliness 6.7 10.7 
9. IIB7- Ordinary activity 93 67 
10. IAgb Succorance from peer 67 80 
11. 182 Inadequacy 53 93 
12. 183 Curiosity 93 53 
13. IAgg Unrequited love 8o 5:3 
14. IC2 Legal restriction 93 49 
15. IC6 War 8o 58 
16. IAgh Illicit sex 2. 9-3 
17. IBio. Rumination 40 80 
18. 187 Guilt-remorse 53 49 
HI I Alternate themes 21.3 10-7 
HIA Symbolic 40 40 
Ill J Comments about artistry 2/ 40 
HI G Autobiographical 53 13 
E. Identification of Characters and Objects 
1. Confusion. of sex 
a. Misidentification immediate 6.7 147 
b. Uncertain at first and then mis- 
identify oo 53 
c. Uncertain at first and then iden- 
tify correctly oo 27 
d. Unable to decide whether male 
or female 10.7 09 


2 
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L INTRODUCTION 


D URING the past decade many new per- 
sonality tests have been devised, 
little-used old tests have been revised, 
popular tests have been polished, re- 
fined, and expanded. College catalogues 
throughout the United Síates have 
lengthened lists of .courses offered in 
psychology to include instruction in the 
administration and interpretation of per- 
Sonality tests in general, of “projective 
techniques” in particular, and of the Ror- 
schach Ink Blot Test specifically. 
Literally hundreds of research reports 
and observational articles have been pub- 
lished on the Rorschach and its clinical 
use. The bulk of this literature has been 
concerned with Rorschach patterns as 
they relate to psychiatric syndromes, to 
social or cultural groups, to successful 
and unsuccessful students, workers, or 
military groups. The practical or em- 
Pirical success of the instrument has been 
widely reported. The operational results 
of the Rorschach as a diagnostic instru- 
ment have been repeatedly acclaimed. 
There is, however, a dearth of labora- 
tory investigation of this important and 
Widely-used clinical tool. In fact, there is 
€ven a question as to whether the test ac- 
tually is a projective technique. Schach- 
tel (31) offers an excellent critique of 
the projection hypothesis of the Ror- 
Schach test, and Bellak (3) states flatly 
that Rorschach’s test is not a projective 
method. More clarification of these philo- 
Sophical and theoretical areas underlying 
the Rorschach is definitely indicated. | 
Too, the paucity of statistical studies 
Of reliability and validity, or of adequate 
Norms, plus the existence of contradic 
tory reports on these crucial points has 
Siven rise to a diapason of criticism. This 
Cacophony has been particularly painful 


to the ears of the conscientious clinical 
psychologist who daily uses the Ror- 
schach for diagnosis, for recommenda- 
tions for therapy, for vocational or edu- 
cational guidance and counseling, etc. 

As Hsü (16) points out, there are two 
popular and disparate attitudes toward 
the Rorschach held by equally psy- 
chologically sophisticated persons. One 
maintains that the prime importance of 
the instrument is its revelation in sym- 
bolic terms of the personality of the in- 
dividual, considered as a unique uni- 
verse; the other stoutly maintains that 
this test, like all other psychological meas- 
uring devices, should meet at least mini- 
mnm standards of statistical reliability 
and validity. 

Hunt (17) criticizes the over-emphasis 
in clinical psychology on quantitative 
data rather than qualitative, and bases 
his hopes for the future progress of clini- 
cal testing on an increased attention to 
qualitative behavior of the person tested 
rather than on quantitative results of the 
test. 

Qualitatively, does a subject, in fact, 
satisfy one of the basic assumptions un- 
derlying all projective techniques—that 
his concepts reveal his consistent way of 
organizing experience, as measured by 
the ideas and feelings he projects into 
"meaning-free" or ambiguous stimuli? Is 
a subject reacting only to the “unstruc- 
tured” or “semi-structured” ink blots on 
the Rorschach test, or is he reacting to 
the total perceptual and social field, 
which includes the administrator of the 
test and the affective tone of the ad- 
ministrative situation? If these latter two 
variables are altered, will the subject's 
perception of the ink blots vary signifi- 
cantly? Can a subject conceal his way of 
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organizing experience or simulate a pat- 
tern not his own? Can an administrator 
induce an altered Rorschach pattern by 
explicit or implicit suggestion? 


There have been some very interesting explora- 
tions of these last two questions. Fosberg (11) 
studied attempts to fake Rorschach results, and 
also (12) examined protocols obtained under 
varied instructions. He demonstrated that psy- 
chologically sophisticated subjects, given a neu- 
tral or control Rorschach, could not conceal their 
basic test patterns on two subsequent tests in 
which they tried to give the best, then the worst, 
possible impression. 

Schachtel (30) emphasizes the importance of a 
subject's personal definition of the test situation 
and its demands. He also strongly presents his 
personal conviction that the three most common 
subjective definitions of the Rorschach situation 
are as authoritarian, competitive, or resistance 
situations. These definitions, he maintains, in- 
fluence Rorschach results. 

Levine (22, 23) reports a study of a series of 
protocols produced by the same subject while 
experiencing different situations hypnotically 
induced. There were marked changes in the 
Rorschach rccords of the various situations, each 
taking on the coloring considered characteristic 
of a person experiencing the situation created 
by hpnotic suggestion. 

Luchins (27) studied the influence on Ror- 
schach responses of situational and attitudinal 
factors, Among his findings was the marked 
need for the development of methods which 
would prepare a subject for the Rorschach test. 
A significant factor in the Rorschach results he 
analyzed proved to be misunderstandings of 
directions and subjects’ hazy concepts of what 
was expected of them in the testing situation. 

Wilkins and Adams (33) studied results of 
Rorschachs obtained under hypnosis, under so- 
dium amytal, and in the normal, waking state. 
'Тһеу found that both the chemically and psy- 


chologically ind:iced abnormal states contributed 
to increased productivity on the Rorschach 
among subjects described as "overly cautious" or 
"fearful." 

An interesting variation in the sort of study 
conducted by Levine is that of Bergman, Graham, 
and Leavitt (4) who reported the Rorschach 
records of a hypnotically regressed patient, diag- 
nosed as a conversion hysteric. They found that 
the protocols varied with the level of hypnouc 
regression but closely followed the expectancies 
growing out of other clinical data. 

Cofer (9) studying the changes in responses 
given by a patient with organic involvement 
while under hyoscine, found that some of the 
cards are more sensitive than others to situa- 
tional variations. 

A recently published research project of the 
Army Air Forces (32) included an analysis of 
the influence of the examiner on the number of 
responses obtained on Rorschach records. of 
36 t-ratios between the means of nine examiners, 
12 were significant at the 1 per cent level of 
confidence and three at the 5 per cent level. 
Clearly this one scoring symbol (R)! would seem 
to vary with the examiner, 


Keeping this last finding in mind, and 
hypothesizing a continuum between the 
findings of Levine and Fosberg, one won- 
ders whether the non-hypnotized subject 
ever experiences attitudes toward the ad- 
ministrator or mood reactions to the total 
administrative situation which are ade- 
quate to alter the determinants of his 
Rorschach record significantly. This, in 
brief, is the problem which led to the de- 
velopment of the present study. 


1 ý š " am- 
An explanation of the Rorschach scoring Sy?" 
bols used in this study is given in the Appenc IX’ 
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IL. THE PROBLEM 


TATEMENT OF THE PROBLEM. This 
S study of the influence of negative 
and positive rapport conditions on Ror- 
schach performance is based on the hy- 
pothesis that the perception of ambigu- 
ous or "unstructured" stimuli is not in- 
fluenced by the perceiver's affective re- 
action toward the total stimulus field, 
which includes the administrator—the 
person presenting the "unstructured" 
stimuli. If an experimental test proves 
this hypothesis false, implications are 
legion for the interpretation of test re- 
sults and for the training of administra- 
tors, 

Of importance is the possibility that 
this study—a test of the foregoing hy- 
pothesis—may reveal whether or not the 
administrator is a significant part of the 
stimulus field during the administration 
of projective tests. The theory behind 
projective testing is that the subject struc- 
tures stimuli presented to him in non- 
structured or semi-structured form, If a 
subject is projecting meanings, ideas, 
and feelings into ambiguous stimuli un- 
influenced by the person presenting the 
Stimuli or by the affective tone of the 
administrative situation, there should be 
No significant variation in.his structuring 
of the stimuli if either the administrator 
Or the affective tone of the administra- 
tion is varied. E 

The problem of transference arises in 
all therapies, regardless of type, and the 


findings of this project might conceivably 
~ 

' Cognizance is taken here of the fact that one 
Cannot amplify specific findings related to the 

Orschach ink blots into generalizations con- 
аз a category; how- 
light on the varia- 
tive technique 


throw some needed light on the subject 
of transference-elements. If the require- 
ments for positive transference could be 
determined or the situations in which it 
flourishes could be measured, the selec- 
tion and training of therapists could be 
more clearly defined. It is thought that 
the present study may hold implications 
for this related problem. 

Outside the realm of clinical psychol- 
ogy, this study may have implications for 
social psychology. Too, the manner in 
which inter-personal relationships affect 
perceptual processes is of great current 
interest to systematic psychology (14) as 
well as to the field of clinical psychology. 
While the present study does not answer 
the questions here posed, it does provide 
implications for future research in thesc 
areas. 

Within the framework of the thera- 
peutic situation, there may be at work 
individual variables which cause one sub- 
ject to function more "normally" in a 
threatening environment, another to 
function more productively or adequate- 
ly in an environment of heightened ac- 
ceptance or permissiveness. In so far as 
the therapist's personality may structure 
these mood-situations, the results of this 
project could be revealing. 

Designing an experiment to test 
whether Rorschach records will vary with 
experimental variation of administrator 
and tone of administration, one must ask 
himself, from a theoretical point of view, 
what variations might be expected. Re- 
membering the experiments of Levine, 
one might hypothesize radical shifts in 
protocols from one administration to an- 
other if the administrations were ade- 
quately loaded with different affective 
tones. Remembering the experiments of 
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Fosberg, wherein psychologically sophisti- 
cated subjects proved incapable of suc- 
cessfully simulating the Rorschach rec- 
ords of personality-types other than their 
own, one might predict no significant 
variation in total protocol, regardless of 
administrator or of type of administra- 
tion. + 

Somewhere between these extremes, 
one may ask what sort of variation might 
be expected (a) if a group of subjects 
were given control Rorschach adminis- 
trations, later were given positively 
loaded? Rorschach administrations, and 
still later negatively loaded administra- 
tions; (b) if a group of subjects were 
given Rorschachs by three different ad- 
ministrators at intervals of four to six 
weeks. Following are the hypotheses 
which seem to follow from the cited rele- 
vant studies, and which seem reasonable 
from the standpoint of personality dy- 
namics and the Rorschach symbols or 
projective response-categories which are 
widely believed to be related to these 
dynamics. 

1. There will be a variation in the 
number of responses elicited by the vari- 
ous administrators, regardless of affec- 
tive variable or sequence of administra- 
tion. This hypothesis is based on the 
A.A.F. study, cited above. 

2. There will be no consistent varia- 
tion in number of responses between the 
first, second, and third administrations; 
i.e., no increase or decrease in number 
of responses with successive administra- 
tions. However, the variation in number 
of responses will be related to a subject's 
initial performance. If a subject gives 
thirty or fewer responses on his first test, 
he will increase the number with subse- 


*'The word loaded is here used to mean 


weighted or slanted. See IV, C for complete 
exposition of this term as herein employed. 


LORD 


quent testings; if he gives seventy or 
more on the first, he will decrease his re- 
sponses on subsequent testing.” 

3. Even among “normal” subjects, the 
alterations attendant upon variations of 
affective tone of administration will be 
largely dependent upon individual dif- 
ferences, the more stable personality rec- 
ords showing less change, the more labile 
records showing more change. It is рге- 
dicted that approximately a third of the 
subjects' records will show no differences 
or only minimal changes regardless of 
administrator, order of administration, 
or experimentally varied affective tone of 
administration; a third will be extremely 
variable, fluctuating with administration 
order, administrator, and tone of ad- 
ministration. This stability or lability 
will be reflected in the constancy or shift 
of the M : sumC ratio. 

4. Regardless of the order of adminis- 
tration or the affective loading, there will 
be some variation in the records ob- 
tained by the three administrators which 
will be a function of the different per- 
sonalities of the administrators. Since 
this factor is an unknown variable, it i$ 
not possible to predict the anticipated 
variations specifically. This hypothesis 15 
a logical outgrowth of the known varia- 
tion of the number of responses with 
examiner differences (32). Responses are 
scored as to location areas and determi- 
nants. An increase in responses insures 
an increase in one or another of these 
scoring categories, Furthermore, since 
number of responses has been shown to 
vary with examiner differences, it is not 


— 
* Since thirty to Seventy responses is usually 
Considered the normal range, this hypothesis 
is based on an anticipation of the operation 
of central tendency. 
j т " š 
The particular choice of thirds in this hY 


pothesis is based on verbal prediction made By 
Dr. Bruno Klopfer. 
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unreasonable to hypothesize that other 
Scoring categories of the Rorschach may 
be equally sensitive to administrator dif- 
ferences. 

5. Regardless of the administrator or 
the sequence of administration, there 
Will be variations in the protocols pro- 
duced under the three different adminis- 
trative situations: neutral or control, 
positive affective loading, negative affec- 
tive loading. Specific variations antici- 
pated are the following: 

a. Negatively loaded administration. | 

(1)- Greater scattering of response determinants 
among the signs К, К, Fe, and C', indicating a 
withdrawal from the painful situation (k or K), 
à depression of a situational sort (C^, and a sen- 


sitive approach to the total stimulus environment 
(Ес) as a way of handling anxiety provoked by 


the situation. These signs will tend to increase at 
the cost of the movement and color responses. 
Fm will appear with greater frequency as an in- 
dicator of tension in the face of a threatening 
situation. 

(2) There will be an increase in white-space 
responses (S) as an indication of negativistic or 
oppositional tendencies. 

(3) There will be an increase in F% and in 
FK + F + Fc/R as a reflection of an increased 
effort to master the situation through intellectual 
control or through this control supported by 
heightened sensitivity and introspection. 

b. Positively loaded administration, 

(1) There will be a greater emotional expres- 
sion, or emotional relationship with the environ- 
ment in this more permissive situation; this will 
be reflected in an increase in sumC. 

(2) There will be an increase in movement 
responses, M and FM, indicating a wider play of 
creativity and a greater freedom of expression of 
“Id-drives.” 


III. PROCEDURE 


N EXPERIMENTAL design was con- 
A structed which aimed toward con- 
trolling as many variables as possible. 
The basic unit of the study was 36 males 
between the ages of 19 and 27. They were 
selected at random from a list of approxi- 
mately 200 college sophomores enrolled 
in an introductory course in general psy- 
chology. Each subject was given three 
Rorschachs at four to six weeks’ inter- 
vals. Each of the three Rorschachs was 
administered by a different person. 


The three administrators, A, B, and C, were 
females, not different in any grossly apparent 
мау; i.e, no one of the three was outstandingly 
fat or thin, ugly or beautiful. Administrators A 
and C were brunettes, B was a reddish-blosde. 
The age range among the three was twelve years, 
with B the youngest and C the oldest. Each ad- 
ministrator holds a degree in psychology and has 
had several years’ experience in the use of the 
Rorschach according to the Klopfer technique. 

The Rorschach tests were given in a systematic 
or rotated order so that twelve subjects received 
control tests first, twelve second, and twelve 
third. Twelve received negative tests and twelve 
received positive tests first. The second and third 
testing sessions were similarly varied. 

No subject was tested twice by the same per- 
son. Twelve were tested first by Administrator 
A, second by B, and third by C. The other ad- 
ministrations were distributed similarly. Each 
administrator used exactly the same technique 
for the administration of the Rorschach Test, 
regardless of the affective loading of the pre- 
test situation and regardless of whether the ad- 
ministration was the first, second, or third for 
the subject. 


The negative and positive situations 
included two simple card-sorting tests ad- 
ministered prior to the Rorschach test 
for the purpose of setting the affective 
tone of the session. Identical directions, 
terminology, and techniques were used in 
both the negative and positive pre-tests; 
however, the tone of the administrations 
was differently slanted. Specific directions 
for the administration of the three ses- 


sions were followed explicitly and with- 
out variation by the three administrators. 
These directions were as follows: 


ADMINISTRATION 


Neutral. Standard Rorschach procedure will be 
used. The administrator will be courteous but 
business-like in manner. She must attempt to 
avoid either negative or positive affective loading 
of the situation. There will be no pre-tests nor 
any gathering of biographical data. 

Negative administration. The administrator 
will assume the role of a harsh, rejecting, au- 
thoritarian figure. She must be deliberately un- 
concerned about the subject, not look at him 
while asking questions, preparing tests, or giving 
directions; never smile, give directions in a voice 
of dictatorial harshness, make every “Hm!” 
sound like a sneer, 


Positive administration. The 
will be personall 


administrator 
` Y warm, charming, appreciative 
in manner. She must look at subject with а 
smile while asking questions, preparing tests, OT 
giving directions in an encouraging tone of voice, 
making every "Hm!" sound like a compliment 
for work well done. 

E he following administration will be used 
verbatim for both negative and positive adminis- 
trations: 

Are you Mr. Blank? 
room. Sit here, 


Come with me! This 


What is your full name? You have an address? 
eshman, Sophomore, Junior? Your birthdate? 
Are you familiar with an ordinary deck of 
ying cards? (Place deck face down, with one 
k nine on the bottom of the pack.) When 1 
say GO turn the cards, one at a time, and pull 
out all the black nines and the red tens, Put the 
black nines here and the red tens here (indicate 
areas with gestures). Work as fast as you сап. 
Ready? Go! (Answer any questions only by 
repeating words in the foregoing instructions. 
Note any cheating, i.e., number of times subject 
turns two or more cards at a time; also note any 
errors in carrying out instructions or any requests 
for repetition of instructions.) Record time. 

ane Some Speed! (Leave nines and tens: 
ia {еу О cards; replace, face down, be- 

J ч 
à m p I say GO, turn the cards, one at 
E € See how many face cards— Jacks: 
iun i Kings—you can pull out before n 
ud "t. the red face cards on the black 
ra Hie black face cards on the red tens. 
eady? Go! (Allow thirty seconds.) 


Fr 


- 


а 


EXPERIMENTALLY INDUCED V. 


STOP! (Count face cards and yecord number.) 

Hm! Not bad! 

(The Rorschach Test will follow using the 
Same instructions as for the neutral administra- 
lion.) 


Аз the experimental design reveals, 
each subject follows a unique pattern. 
Consequently, if a subject were lost at 
any stage of the experiment, а substitute 
Would have to be started from the very 
beginning. In order to insure completion 
of the design, the entire pattern was 
Started in duplicate; i.e., seventy-two sub- 
jects were given first Rorschachs. Despite 
there were several cases 


this precaution, 
lost, and a 


wherein both of the pair were 
third subject (with his parallel substitute) 
was started through the design. The prin- 
cipal cause for loss appeared to be the 
subject's reactions to the negatively ad- 
ministered session. Following this session, 
the subjects seemed more difficult to con- 
tact for testing. appointments and fre- 
quently broke appointments or failed to 
appear as scheduled. Thirteen of the orig- 
inal seventy-two subjects were dropped 
following the negatively administered 
test because they had failed to keep from 
three to five appointments for the subse- 
quent test. Two subjects were dropped 
for the same reason following positively 
administered sessions. Eight of the thir- 
teen were given negatively loaded tests 
during the first session, five during the 
second. The losses, by examiner, were ap- 
proximately equal, two losing four sub- 
jects each, the third losing five after nega- 
tive administrations. The only reason for 
dropping these subjects, rather than per- 
sisting until an appointment was kept, 
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was the time factor. If a period longer 
than six weeks elapsed between sessions, 
the subject was dropped and his parallel 
substituted in the pattern. 

With this evidence in mind, one may 
speculate that the differences between 
negative results and positive or neutral 
results might have been greater if 
methodology had permitted retention of 
the negative records of the subjects who 
apparently rejected the entire’ project 
after experiencing the negative adminis- 
tration. 

A few subjects were lost for other rea- 
sons. Three moved from the community 
before completing the three sessions; two 
were ill between sessions and unable to 
meet the requirement of a maximum 
time-lapse of six weeks between tests. 

In every other case, testing of parallel 
partners was dropped as soon as the prin- 
cipal subject had completed his pattern 
and thereby precluded the need for a 
substitute. 

As Rorschach records were accumu- 
lated, they were coded and mixed. The 
one hundred eight records were then 
scored without reference to subject, name 
of administrator, type or order of ad- 
ministration. Only main responses and 
determinants were used in the analyses 
of the records for two reasons: (1) in- 
corporation of additional responses and 
determinants would unprofitably over- 
complicate the study; (2) there are 
marked differences among exponents of 
the Rorschach method (e.g., Beck, Buh- 
ler, and Klopfer) as to the most desirable 
use to make of these additional deter- 
minants in scoring and in interpretation. 


IV. RESULTS OF THE EXPERIMENT 


N ADDITION to the loss of subjects 

following negative administrations, 
there is other evidence that the negative 
situation was actually traumatic to the 
subjects. Although the very simple card- 
sorting pre-tests were designed merely to 
set the administrative tone of the ad- 
ministration for the Rorschach, behavior 
on the pre-tests suggests that in many 
cases the subjects were reacting different- 
ly by the time the pre-tests were intro- 
duced. The initial greeting, without a 
smile or a direct look, plus the manner 
and tone of voice in which the four per- 
sonal-data questions were asked, ap- 
parently caused the subjects some con- 
cern. 

A. BEHAVIOR ON Pnr-Trsrs — 


Not a single subject on the positively- 
loaded administrations misunderstood 
the pre-test directions or made an error 
in carrying them out. Twelve of the 
thirty-six subjects made from one to six 
errors on the negative administrations, 
with a total of twenty-five errors, Fail- 
ure to understand directions was counted 
as an error if the subject requested repe- 
tition or clarification. If he demonstrated 
his confusion through incorrectly carry- 
ing out the directions, he was not cor- 
rected, but the error was noted. 

During the Rorschach administration, 
the writer recorded every audible sigh or 
audible laugh. The laughs and sighs 
elicited by this one administrator give 
further qualitative evidence that the sub- 
jects were, in fact, reacting differently to 
the different affective loadings of the ad- 
ministrations: Eleven of the thirty-six 
subjects sighed aloud from one to seven 
times (total of thirty-two sighs) during 
the free-association period of the Ror- 
schach, i.e., during the initial presenta- 


tion of the plates. One subject sighed 
aloud during the positive, two during the 
neutral, and eight during the negative 
administrations. Seven subjects laughed 
aloud once or twice with a total of nine 
audible laughs; three laughed during the 
positive, two during the negative, and 
two during the neutral administrations. 
No effort was made to tabulate smiles 
and merely visible sighs. 

One observes that audible sighs occur 
with greater frequency during a cold or 
rejecting situation than during a warm, 
permissive situation or a neutral situa- 
tion, but that laughing aloud occurs with 


approximately equal frequency during 
the three different situations, 


B. VARIATION IN NUMBER OF RESPONSES 


: The hypothesis concerning successive 
increase or decrease of number of re- 
sponses as a function of the number 
given on the initial test held well for two 
cases which initially gave more than 
seventy responses but failed to hold con- 
sistently for twenty-six cases giving fewer 
than thirty responses at the outset. 
There is an apparent bias in testing Te 
sponse limits exceeded by such an um- 
even number of subjects as exceeded our 
limits; ie, two versus twentysix. The 
explanation of this procedure rests in the 
earlier hypothesis based on erroneous 
original expectancy as to number of re- 
Sponses. Rorschach literature quite gen- 
erally states that thirty responses can be 
considered the minimum expectancy 
with normal, intelligent, adult subjects, 
seventy responses the maximum expe 
tancy. The failure of our subjects to con 
form to this expectancy is discussed later 


under the subheading, “Number of re- 
sponses.” 
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C. EnLEBNISTYP 

Hermann Rorschach's theory of per- 
sonality rests on the foundation of what 
he called Erlebnistyp, a word which has 
been variously translated but which is 
consistently rendered Experience Type 
in the English translation of Rorschach's 
original paper. He (29) makes reference 
to the problem, so labeled, throughout 
his Psychodiagnostik, and insists that an 
individual's protocol should be inter- 
preted within the framework of his 
Erlebnistyp, that is, his introversive, ex- 
tratensive, or ambiversive orientation, his 
"experience-balance" in responding to 
stimuli from within and/or stimuli from 
without. Rorschach found that Erlebnis- 
typ was reflected in the ratio between 
movement and color responses (M : 
sumC). 

The scoring symbol M denotes, ac- 
cording to Rorschach, “Form perceptions 
plus kinaesthetic factors"; (29, p. 25) and 
he adds that “The more kinaesthesis, the 
more stable the affectivity” (29, p. 26). 
"Color responses," he continues, "have 
proved to be the representative of the 
affectivity and the rule is, the more color 
in the test, the greater the emotional in- 
stability of the subject" (29, р. 76). 

A person with more M than C in his 
record would be designated the M-type 
(introversive), described by Rorschach 
(29, p. 81) as having the following char- 
acteristics: 

1. Predominance of personalized pro- 

ductivity. 

. Intensive rapport. 
3. Stable affect and motility, 


insufficient adaptability to 
nt extensive 


nj 


awkward- 


ness, 

reality and insufficie 
rapport. 

A subject with m 

be designated the C-type; 

sive than introversive. The 8 


ore € than М would 
more extraten- 
eneral char- 


acteristics of this type, Rorschach (29, p. 
83) lists as follows: 

1. The urge to live in the world out- 

side oneself. 

2. Restless motility. 

3. Unstable affective reactions. 

Klopfer (21) has pointed out the neces- 
sity of taking into consideration other 
factors in the record before venturing 
more than a simple statement of the sub- 
ject’s “natural inclinations” as reflected 
in his M : sumC ratio. These additional 
considerations give information as to 
whether the individual follows these in- 
clinations or is in a state of conflict over 
them. 

For the purpose of the present study, 
we may confine ourselves to an analysis 
of the extent to which lability and sta- 
bility of the M : sumC ratio exists within 
the framework of our experimentally 
varied testing situations. 

Rorschach recognized the existence of 
temporary variations in number of M 
and С in a subject's record if the mood 
of the subject shifted between dejection 
and elation; however, he stated that de- 
spite the variation in absolute number, 
“the proportion between them (M and С) 
changes little or not at all" (29, p. 94). 

In the 108 records produced by our 
thirty-six subjects, variation in absolute 
number ranged from 0 to 19 on the M 
side and from o to 13 on the sumC side. 
The proportion between the M and C 
was exceedingly shifty; and, in addition, 
there were numerous actual shifts of 
weight in the ratio from one side to the 
other. The number and direction of 
these variations in Experience Balance 
are summarized in Table I. 

From Table I it can be seen that the 
greatest stability of M : sumC ratio exists 
between administrations I and II; even 
here, 31 per cent of the cases shifted from 
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one balance to another; ie., there was 
variation within approximately a third 
of the persons tested, as measured by an 
individual's production of more move- 
ment than color responses in one situa- 
tion, more color than movement re- 
sponses in another situation. A close sec- 
ond in rank order of stability is that be- 
tween records obtained by administrators 
B and C, wherein 67 per cent maintained 
a constant balance. 

The greatest number of shifts in bal- 
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the experimeat regardless of sequence, 
administrator, or type of administration. 
Such a conclusion cannot be drawn [rom 
Table I, however, since stability therein 
is noted within, but not between, the 
variable situations. 

Ап examination of the data reveals 
that only fourteen of the thirty-six cases 
maintained a constant balance on the 
M: sumC ratio throughout all variations 
in testing situations, Of these fourteen, 
ten held a balance weighted on the M side 


TABLE I 
NUMBER AND DIRECTION OF SHIFTS IN BALANCE ON M: sumC Ratios 


Frequency and Direction of Change 


Situation % % 
MtoC CtoM A*toM AtoC MtoA CtoA Stable Stable Unstable 

I to II 1 2 ° $^ 4 I 2 6 I 
I to Ш 5 4 2 3 3 o i js A 
II to III 6 6 I 4 H o 18 х5 50 
Ato B 1 2 1 2 4 2 18 50 50 
Ato C 8 2 3 2 I 1 19 5 47 
B to C 2 4 2 4 o o 24 25 33 
oto + 2 5 2 І 2 4 20 56 "m 
+ to = 4 5 3 2 2 ° 20 56 44 
oto — 2 7 1 2 1 2 21 58 42 

Totals 37 37 15 23 18 10 184 57 A3 


* The symbol A is used to denote ambi-equality of M and sumC, 


ance occurred equally between Adminis- 
trations II and III and between Adminis- 
trators А and В. In both these categories, 
exactly half the cases maintained a con- 
stant balance and half shifted in one 
direction or another. 

In reading Table I, it must be remem- 
bered that a shift, from M to C for ex- 
ample, means that the subject actually 
showed more M than C under the first 
condition considered and more C than M 
under the second condition listed. 

A first glance at the totals in the 
columns labeled "Per Cent Stable" and 
"Per Cent Unstable" might suggest that 
approximately half the subjects main- 
tained a constant stability throughout 


of the ratio, four showed consistent 
weighting on the sumC side, and none 
maintained ambi-equality of weighting: 
One may, therefore, conclude that 39 per 
cent of the subjects retained a stable Ex- 
perience "Type throughout the experi 
ment, approximately three fourths of 
whom were introversively oriented, а 
quarter of whom were extratensively 
oriented. 

These conclusions, then, would sup- 
port the prediction stated earlier in this 
study that approximately a third of the 
subjects would produce stable records. 
However, on the basis of Rorschach’s 
description of the M-type and C-type per- 
sonalities, noted earlier in this discussion, 


D 


> 


> 
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one would have suggestedghat the four- 
teen subjects with stable records would 
all have been of the M-type. Table I 
shows us that A-type, M-type, and C-type 
personalities in one situation can become 
another personality type in another 
situation. The additional analysis of the 
data shows us that both M-type and C- 
type personalities are capable of main- 
taining their personality type in the face 
of varying situations, but that the odds 
are three to one that the subject who 
maintains such constancy will be an M- 
type personality. 


D. STATISTICAL TREATMENT OF THE DATA 


In order adequately to test the princi- 
pal hypothesis of this experiment con- 
cerning the influence of negative and 
positive rapport conditions оп Rorschach 
performance, and the numerous col- 
lateral hypotheses—the influence of suc- 
cessive administrations and different ad- 
ministrators, and the kind and amount of 
variation in the numerous scoring cate- 
gories—one must look to statistical analy- 
Sis. 

To justify the greatest confidence in 
probabilities obtained from analysis of 
the data, t-ratios were computed for the 
airs of series offered by the 


numerous p 
о used was 


data; The particular ітай | 
Fisher's formula as described by Lind- 
quist (26, р. 58 fI.) under the title “The 
Significance of a Difference in the Means 
of Related Measures": 


of the criticism to 
as a result of the 
categories as unique 


1 Cognizance is taken here 
Which this study is subject 
Practice of treating scoring 
Parts of a ‘hele without reference to the whole 
of which they are an integral part. The яе 
сап think of no other method of cipem 
approach to the investigation of the тогава 
and the personality factors it is judged to ШЕ 
ure except by analysis of the separate жеше E 
Which contribute to the whole. Recognizing ie 
imperfection of the approach, the plea is 0* aa 
that it remains the best of all possible exisi ing 


Approaches. 


" M, — Mg 
Sum d? 
n(n — 1) 


These t-ratios (a total of 243) and the in- 
terpretations of them are presented in 
the following pages. 

In working the /-tests, data were frac- 
tionated in the following manner: АП 
tests given during the first session were 
grouped, regardless of who gave them 
or of what kind of affective tone marked 
the administration; this procedure was 
followed for the second and third ad- 
ministrations as well. All tests given by 
Administrator À were grouped, regard- 
less of whether they were administered 
first, second, or third in the series, re- 
gardless of whether the tone of ad- 
ministration was positive, negative, or 
neutral. A similar grouping was made for 
tests given by Examiners B and C. All 
tests given during а positively loaded 
test-situation were grouped without re- 
spect to who administered them or to the 
serial number of the situation. Identical 
groupings were made for tests admin- 
istered in the negative and the neutral 
situations. 

This combination of heterogeneous 
data, of course, introduces a special prob- 
lem in the interpretation of the results 
of the t-tests. If variance is found due 
to administrations, this variance is op- 
erating when testing the differences be- 
tween administrators and between types 
of administrations. Where significant 
variance exists only within one of the 
three major groups of variables, this 
problem loses some of its importance. If, 
however, variance is shown to exist as a 
function of the repetition of the test, this 
variance will operate to enlarge the 
sigma, hence enlarge the denominator, 
of the t-tests between other variables. 
The influence would produce spuriously 
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low tratios among the other groups of 
variables. _ 

This weakness of the t-test, which in- 
evitably exists when data are fractionated 
and combined as in the present study, 
is not a serious drawback if one bears in 
mind that t-ratios approximating zero 
(say, under 1.000) throughout one whole 
set of data strongly indicate no real dif- 
ferences; if one remembers, also, that any 
significant or very significant t-ratio in 
one of the three groups of variables 
within a set will contribute to spuriously 
low t-ratios among the other groups, and 
that, therefore, a marginal t-ratio would 
be significant if the variance due to other 
factors were partialed out. 

One other element of the statistical 
treatment demands discussion: Compu- 
tation of t-ratios has been made on mean 
frequencies of scoring categories regard- 
less of whether these scores are expressed 
as percentages of the total responses or as 
absolute numbers. While the percentages 
are not seriously affected by the variation 
in total number of responses, the abso- 
lute numbers are. For example, if the 
positive administration produces more 
responses than the negative, there will 
not necessarily be any alteration in the 
per cent of whole responses; there will, 
however, necessarily be an increase on 
the positive administration in the fre- 
quency of certain of the response deter- 
minants. On a statistical level, one is 
eager to discover which determinants in- 
crease; on an interpretative level, one at- 
tempts to explain why one determinant 
increases and another does not, whether 
the increase is a concomitant of increased 
responses or not. 

If there were a significant difference 
in number of responses from the first to 
the second to the third administration, 
any differences in tabulated Scoring sym- 
bols would have to be held suspect as an 
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artifact or the frequencies would have 
to be reduced to ratios. However, no such 
differences in R. were found to exist. 


E. STATISTICAL FINDINGS 

Number of responses. The greatest con- 
stancy in mean number of responses 
proved to be that between administra- 
tions. It is interesting to note here that 
Rorschach literature is consistent in re- 
porting that the normal number of re- 
sponses is around thirty with normal 


ranges reported from fifteen to seventy- 
five. 


Hermann Rorschach: Normal subjects gencr- 
ally give from 15 to go responses, rarely less than 
15, often more than зо (29, р. 21). 

Klopfer and Kelley: The range of responses 
found most frequently in all large-scale investiga- 
tions of adults seems to be between twenty and 
forty (21, pp. 208-209). 

Brussel and Hitch: Normal is around 34... · 
Normal range is from 25 to 75 (6, p. 3). 

Bochner and Halpern: The average (number 
of responses) seems to fall between 20 and 59 
(5, p. 71). 

Charlotte Buhler gives a minus weighting of 
3 for fewer than twenty-five responses (7, p. 11): 


As an aside, one wonders if these pub- 
lished means and ranges are not a bit 
high. Only on the positive administra- 
tion and under C's administration do our 
subjects reach or exceed means of thirty 
responses. 

Further in this vein, it should be noted 
that the previously cited study conducted 
by the Army Air Forces (32) shows the 
average number of responses for service- 
men undergoing classification testing to 
be only twenty, Perhaps this particular 
Army test situation stimulated an inhibi- 
tion of responses, Perhaps, too, the 5р6" 
cial population of the present study €X- 
perienced a special kind of inhibition of 
response. 

Perhaps, on the other hand, the figure 
thirty is not the true mean for the popu- 
lation as a Whole, is not the appropriate 
cut-off point for a normal number of 


TW % 
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responses. Possibly separatg norm-groups 
should be established for various seg- 
ments of the population. 

Although there is an increase in the 
mean number of responses on both the 
negatively and positively loaded admin- 
istrations over the neutral administra- 
tion, with the positive eliciting more 
than the negative, the variability within 
each type of adminisration is large 
enough to keep the differences below the 
critical cut-off point for statistical signifi- 
cance higher than the 10 per cent level of 
confidence which exists between the posi- 
tive and the neutral administrations. 

Thre is a very significant difference, 
however, between the number of re- 
sponses elicited by Administrators А and 
C. Whether giving a first, second or third 
administration, whether being negative, 
positive, or neutral, Administrator C ob- 
tained, on the average, а sufficiently 
larger number of responses virtually to 
rule out chance as a cause of the dif- 
ference (1 per cent level of confidence). Р 

Average time рет response. Very sig- 


nificant differences were found in the 


average response time between the first 
test and both the second and the third 
tests, These differences can be explained 
rather easily in terms of familiarity with 
the task. On being first confronted with 
the Rorschach plates, the subjects unt 
formly spent more time formulating their 
concepts than they spent on subsequent 
reassociation with the stimuli. The reduc- 
tion in response-time is uniform in direc- 
tion from beginning to end of the series, 
despite the lack of a significant difference 
between the second and third administra- 
tions. 

There is a consistent tendency toward 
a difference in response times between 
Administrator С and the other two ad- 
ministrators; in fact, the difference be- 
tween A and C is significant at the 5 per 


cent level when rounded to two decimal 
places. Apparently the subjects, on the 
average, lingered longer over their re- 
sponses when giving them to Administra- 
tor C. 

Crude control. F% is considered to be 
one of the key determinants in a Ror- 
schach protocol. It is the scoring symbol 
used to indicate the percentage of re- 
sponses determined solely by the shape, 
contour, or outline of the blot. Briefly, 
it is interpreted as a measure of the con- 
trol exercised by the subject. Buhler and 
Lefever (8) designate it as a measure of 
rational functioning. Excessive F% (50 
per cent or above) is considered to be an 
indication of undue constriction, or an 
accompaniment of depression or anxiety. 

While the means of the various meas- 
vres of our subjects stay well under the 
50 per cent upper limit for normalcy, we 
may speculate that significant increases in 
F% mean increased depression, anxiety, 
or constriction. No such significant in- 
creases are found as a function of the 
number or the type of the administra- 
tion; however, it is interesting to observe 
that the lowest mean is found for the 
positive administration and the highest 
for the negative, 

Again we find significant differences 
among administrators, A differing at the 
1 per cent level with C and at the 5 per 
cent level with B. There is no difference 
between B and C. 

Refined control and its components. 
The FK + F + Fc per cent is another 
measure of control; however, it is dis- 
tinguished from F% alone by the quality 
of the control it reflects. Klopfer (21) 
has called the latter “crude control" and 
the former "refined control." Separately 
considered, the determinant FK (vista 
concept) is usually interpreted as reflect- 
ing introspection or insight; Fc (texture 
concept), as reflecting tact or sensitivity. 
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Although there is not a critically sig- 
nificant difference in the amount of re- 
fined control displayed in the various 
types of administration, the means show 
a consistent increase from the lowest 
mean on the positive administration to 
the highest mean on the negative ad- 
ministration, paralleling the tendency on 
F% or crude control. 

A significant difference between the 
first and the third administration sug- 
gests several alternative possibilities, The 
mere repetition of the Rorschach prob- 
ably brought about a degree of loss of 
spontaneity in responding to the stimuli. 
This was replaced successively more fre- 
quently by increased refined control of 
reactions to the stimuli. 

While there is practically no difference 
between Administrators A and B in tbe 
average percentage of responses with this 
combination of determinants, Adminis- 
trator C differs at the 10 per cent confi- 
dence level from A and differs signifi- 
cantly from B. 

Further light may be thrown on the 
factors influencing these differences in 
the combination-score by inspecting the 
components that contribute to it, F% has 
already been analyzed. The raw data 
show a low frequency of FK and Fe re- 
sponses. The responses, however, occur 
with adequate frequency and with suffi- 
cient approach to normality of distribu- 
tion to permit application of the t-ratio 
statistic. 

A t-ratio for Fc between Administrators 
B and C was not computed since the dif- 
ference between means of :05 is so small 
as to insure a t-ratio of approximately 
zero. Despite the low frequency and small 
differences in means between A and the 
other two administrators, the variability 
was sufficiently small to produce a signifi- 
cant difference between A and C, and a 
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very significant difference between A and 
B. The latter two administrators con- 
sistently elicited more Fc responses than 
did A. This observation would tend to 
rule out Fc as a contributing factor to 
the significant difference between B and 
C on the FK + F + Fc percentage. Con- 
sidering, too, the insignificant t-ratio be- 
tween B and C on F%, one might sup- 
pose that despite very low frequency and 
apparent scatter, the FK responses are 
the greatest single contributing factor to 
the ЕК + F + Ес per cent difference be- 
tween B and C; however, this convenient 
supposition is not supported by the data, 
as will be subsequently shown. 

It is interesting to note that F% (crude 
control) was sufficiently higher for A 
than for B or C to produce significant 
and very significant differences respec- 
tively; whereas, Fc (tact or sensitivity) 
was sufficiently lower for A than for B 
and C to produce very significant and 
significant differences respectively, It is 
not surprising, then, that when these two 
determinants, F and Fc, are summed with 
à third determinant FK the differences 
between A and both B and C disappear 
but the difference between B and C is 
exposed. 

The scoring category FK is used for 
Concepts incorporating vista, distance, 
perspective. There are no significant dif- 
lerences in FK on the successive admin- 
istrations, although the means reflect à 
consistent decrease in the use of this de- 
terminant, Nevertheless, no inferences 
can be made from such data. 

The lack of Significant differences 
among administrators also permits no in- 
ae ent or conclusion. w 
fered mb rdi ont Analysis of FÉ nai 
growing out oy a ol ше ачен 

g of the findings for FK + F 


+ Fe per cent. This latter category, then, 
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appears to be a unique whple, not paral- 
lcling any one of its three components in 
its stability or variation under the varied 
situations. 

The affective loading of the adminis- 
trations had a significant degree of in- 
fluence on the scoring category FK. Both 
the negative and the positive administra- 
tions show more mean FK than docs the 
control administration. The difference 
between the neutral and the positive ad- 
ministrations is significant at the 5 per 
cent level of confidence. 

The symbols К and k. Responses classi- 
fied as K are interpreted as a sign of 
“free-floating” anxiety or of insecurity. 
Douglas (10) found the use of this type of 
response serving "as conversational ma- 
terial until something further can be 
found." 

K responses remain quite stable 
through the successive administrations. 
There is a noticeable difference, however, 
in the amount of K elicited by the dif- 
ferent administrators, B and C differing 
at the 10 per cent level of confidence. 

Like K, responses classified as k are 
interpreted as reflecting feclings of anx- 
iety, insecurity, or inadequacy; however, 
the two categories are differentiated by 
the control factors implicit in a k re- 
sponse, absent in a K response. . 

The low frequency of the determinant 
k is reflected in low means for the various 


measures. Neither the variation of ad- 
the type of administra- 


ministrator nor 
tion affected this scoring symbol to à 
degree worth noticing. There is a con- 
sistent, though insignificant, reduction in 
the appearance of this determinant on 
the successive tests, each administration 
cliciting somewhat fewer such responses 
than the preceding. . : 

The C' determinant. Our thirty-six 
Subjects produced, on the average, about 


one and a half achromatic color responses 
(C^) per record without variation from 
one administration to the next. There 
was variation in the C' responses with 
type of administration. The increase 
from an average of one C' on the con- 
trol Rorschachs to an average of two on 
the positively loaded Rorschachs boosts 
the confidence level between these vari- 
ables to 10 per cent, Administrator А 
elicited the lowest frequency of C/ re- 
sponses, differing from both B and C at 
the 5 per cent confidence level. 

Movement responses. There are no 
statistically significant differences higher 
than the 10 per cent level of confidence 
in the number of Fm, inanimate move- 
ment, responses produced under any of 
the three variations of the testing situa- 
tion. The constancy in number from one 
administration to another is so great as 
to preclude the necessity of computing 
t-ratios. 

Mean differences, however, exist as a 
function of the variation in type of ad- 
ministration, the t-ratio between the con- 
trol and the positive administrations ex- 
ceeding the 10 per cent level of confi- 
dence that a real, not chance, difference 
exists. 

The animal movement responses, FM, 
vary little or not at all on the successive 
administrations, and they fail to vary 
significantly with variation in type of 
administration. However, there are differ- 
ences at the 5 per cent level of confidence 
between Administrators A and C, and at 
the 1 per cent level between B and C, 
Administrator C eliciting the larger num- 
ber of responses in both comparisons. 

The frequency of responses classified as 
M, human movement, varied sufficiently 
to reach the 5 per cent level of confidence 
between one pair of variables and the 10 
per cent level between four other pairs. 
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Human movement is consistently inter- 
preted as a sign of inner adjustment or 
equilibrium, a capacity for the absorp- 
tion of emotional stimuli whether origi- 
nating from within or without. 

An excess of M on the positive admin- 
istration over the neutral created a differ- 
ence significant at the 5 per cent level of 
‘confidence, and the positive administra- 
tion shows a difference from the nega- 
tive at the 10 per cent level. 

Administrator C provoked the highest 
mean frequency of M responses, differing 
from both A and B at the 10 per cent 
level of confidence. Further, the mean 
number of M on the third administration 
was sufficiently below the number on the 
second to give a Latio significant at the 
10 per cent level. 

Emotionality. 'The C-type personalizy 
was discussed earlier in this paper, and 
the relationship of sumC to M was de- 
fined. Not forgetting the importance of 
this inter-relationship, let us look at the 
meaning of color responses per se in Ror- 
schach records. Color is considered to be 
the determinant of emotionality. Lack of 
color is considered an indication of con- 
striction; excessive use of color is inter- 
preted as emotional lability or over- 
reaction to external emotional stimuli. 

The use of color varied little on suc- 
cessive administrations. There was an in- 
finitesimal difference between the sub- 
jects’ sum of color responses on the neu- 
tral and the negative administrations; in 
fact, no significant differences occurred 
among any of the types of administra- 
tion. 

The really important differences in the 
use of color responses occurred with the 
variation of examiners, Administrator A 
elicited significantly fewer color responses 
than C and very significantly fewer than 
B. 


Content categories. Normally, the sum 
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of animal and gnimal-detail concepts con- 
stitutes approximately 50 per cent of the 
content of Rorschach concepts. Our sub- 
jects linger close to this norm. An in- 
crease in A% is usually interpreted as 
evidence of stereotypy of thinking or a 
repression of intellectual activity; a de- 
crease in A%, contrarily, is widely inter- 
preted as less confinement to the obvious, 
а broader range of thinking and/or inter- 
ests. 

There are completely unimportant dif- 
ferences in A% on successive administra- 
tions of the test. There is no appreciable 
difference in A% from one examiner to 
another; although, again, the differences 
between Administrator A and the other 
two examiners show a stronger tendency 
toward significance than does the differ- 
ence between B and C, that between A 
and С closely approaching the 10 рег 
cent level of confidence. 

The percentage of A% responses on 
the affectively loaded administrations dif- 
fers significantly, with the negative ad- 
ministration producing the percentage 
higher than that on the positive adminis- 
tration. Also, the neutrally conducted 
tests deviate from the positive with a dif- 
ference at the 10 per cent level of con- 
fidence. 

There is agreement among users of 
the Rorschach that the number of animal 
responses should and do exceed the num- 
ber of animal.detail responses in a nor- 
mal, healthy record. Therefore, it was 
considered advisable to break down the 
A% scoring symibol into its component 
parts in an effort to examine the circum- 
stances under which both animal re- 
sponses and animal-detail responses may 
vary or remain stable. 

Examination reveals no significant dif- 
ferences in the number of whole animal 
responses elicited in the various situa- 
tions. Even the means do not show a dif- 
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ference worth comment.,Certainly this 
analysis of the number of whole animal 
responses does not help appreciably to 
clarify the differences obtained in the 
А9. A look at the means and critical 
ratios for animal details is indicated; 
however, one must not overlook the fact 
that absolute number frequencies intro- 
duce different considerations from those 
under consideration when one is speaking 
of percentages. 

In all measures, the number of А re- 
sponses exceeds the number of Ad re- 
sponses, indicating that our subjects on 
the average maintain the healthier bal- 
ance regardless of variation in testing 
situations. With both symbols there are 
infinitesimal differences in frequencies 
from one testing situation to the next. 
Among administrators, C elicits the high- 
est frequency for both А and Ad re- 
sponses. Among the types of administra- 
tions, the control Rorschachs show the 
lowest mean for both scoring categories. 

Differences at the 5 per cent level of 
confidence exist between Administrator A 
and both B and C, with A gleaning sig- 
nificantly fewer Ad responses than B or 
C. 

As with other scoring categories, the 
number of A and Ad responses by them- 
selves are considered to have little mean- 
ing. Whereas, an increase in Ad may in- 
dicate a tendency toward a critical atti- 
tude, this interpretation of excessive Ad 
in a record is made only if the sum of 
animal.detail and human-detail responses 
exceeds the sum of whole animal and 
whole human responses. This observation 
calls for an analysis of the number of 
human and human-detail responses. 

Several measurable differences in num- 
ber of whole human responses exist. The 
frequencies are exceedingly stable from 
the first through the third administration. 
While Administrator C provoked a larger 


mean number of human responses than 
did either А or B, only the difference 
with A is great enough to produce a t- 
ratio significant at the 10 per cent level 
of confidence. 

The means of the types of administra- 
tion show an increase in human concepts 
on both the negative and positive admin- 
istrations over the number on the control 
administration, with the difference be- 
tween the neutral and positive rising to 
the 10 per cent level of confidence. 

Again, the frequency of whole versus 
part concepts of human figures is a more 
important consideration than the mere 
number of either. 

A comparison of means reveals that the 
records of our subjects, regardless of 
variables, maintain a healthy preponder- 
ance of H over Hd responses. 

Administrator C evoked a greater num- 
ber of Hd responses than either B or A, 
and the mean of C's administrations dif- 
fers significantly from that of A. 

Popular responses. 'The means of the 
popular responses elicited by the various 
examiners are identical. There is a slight 
but steady increase in mean P from the 
first to the second to the third adminis- 
tration, which may easily be explained in 
terms of increased familiarity with the 
material through mere repetition. 

The difference between the means of 
the negative and positive administrations 
reaches the 10 per cent level of confi- 
dence, the permissive situation stimulat- 
ing the higher number. 

Use of ground as figure. Neither repe- 
tition of administration nor affective 
loading of administration produced any 
difference in the number of white-space, 
S, responses on the Rorschach records. 
Neither are Administrators A and B es- 
sentially different in the amount of S 
they provoked on the protocols. Adminis- 
trator C, however, differed from both A 
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and B in eliciting a higher frequency of 
white-space responses, and differed from 
А at the 5 per cent level of confidence. 

An earlier discussion of additional re- 
sponses explained why they were not in- 
corporated into the analysis of results on 
this study. An exception should be made, 
however, of the white-space scoring cate- 
gory. Among normal or near-normal sub- 
jects, white-space responses occur most 
frequently concomitant with reaction to 
inked areas of the plates. Arbitrary scor- 
ing methodology requires that, in such 
mixed concepts, the inked area be scored 
as the main response location and the 
white space be scored as an additional re- 
sponse location. Consequently, many 
clear-cut reactions to white space are 
buried among the additional responses. 
Better to understand the actual amourit 
of and variability in the use of white 
space, it was decided to sum the absolute 
number of main and additional S re- 
sponses and analyze the distribution of 
this total reaction to ground. 

The total use of white spaces by our 
subjects varies significantly with type of 
administration and among administra- 
tors. The smallest amount of total S + 
(s) was elicited on the neutral or control 
administrations, the most on the positive 
administrations. While the actual mean 
difference between neutral and negative 
administrations is small, the direction of 
change is so consistent, the variability 
within the columns so small, that the dif- 
ference reaches the 5 per cent level of 
confidence. The higher mean difference 
between neutral and positive administra- 
tions is accompanied by sufficiently in- 
creased variability to hold the difference 
down to a 10 per cent level of confidence. 

Among administrators, there exist real 
differences in the amount of sum-S pro- 
duced, regardless of number or type of 
administration. A evoked the least, and 
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C the most, us? of white space. The dif- 
ference is significant at the 5 per cent 
level of confidence. C also exceeded B 
in elicitation of total S responses with a 
difference significant at the 10 per cent 
level. The most significant difference, 
however, is between administrators А and 
B, with a t-ratio at the 1 per cent level of 
confidence, 

Percentage of responses to all-color 
cards. Color, as previously stated, is in- 
terpreted as a correlate of emotionality. 
Since cards VIII, IX, and X on the Ror- 
schach test are composed exclusively of 
bright-color inks, they are construed to 
have particular significance in reflecting 
emotional components of a subject's per- 
sonality. The normal expectancy of re- 
sponses on these all-color cards is approxi- 
mately 40 per cent of the total responses. 

Our subjects consistently approximate 
normal expectancy, with little variation 
from one administrative situation to an- 
other. The largest difference occurs be- 
tween the neutral and negative adminis- 
trations, where the (-ratio rises to a 10 per 
cent level of confidence. 

: Apparently the over-rejecting situa- 
tions brought about a measurable de- 
crease in response to external emotional 
stimulation, creating a dampening effect 
on the subjects. 

Manner of approach. The number of 
whole responses in a Rorschach record 
provides two separate clues for interpre 
tative analysis: (1) a clue as to the рег 
centage of responses which are based ОП 
corporation of the total blot-stimulus 
in the concept-form 


: ation; (2) а clue as tO 
the ratio betw 


een whole responses and 
movement responses, i.e., the W : M ratio. 
The first of these 


clues will be explored 
later mee 


: à Š : 
!pon the presentation of means and 


t-rati i 
atios for W%. The second requires the 
reintroduction of the 


w means for the 
Scoring symbol M. 
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There is a steady аесгегде in the mean 
number of whole responses with each 
repetition of the test, the difference be- 
tween number of W responses on ad- 
ministrations I and III being significant 
at the 5 per cent level of confidence. 

If the W : M ratio much exceeds 2 : 1, 
the interpretation is usually made that 
the subject is over-extending himself; 
ie, he is striving beyond his means 
of achievement. It is not considered a 
healthy sign. Let us compare our W:M 
mean ratios for any evidence of this im- 
balance. 

One ratio that was obtained on the 
second administration achieves the ideal 
relationship of 2 : 1. With one exception, 
all other ratios show heavier W weight- 
ing; however, the ratio of $ : 1 is not suf- 
ficiently in excess of the ideal to merit 
interpretation of a significant imbalance. 
The ratio on the neutral administration 
of 4 : 1, however, is twice the expectancy 
ratio. On the basis of Rorschach litera- 
ture, one would have predicted a 2: 1 
ratio on this control administration, re- 
gardless of what other predictions at- 
tended the other administrations. 

'The really crucial interpretative fac- 
tor related to whole responses is the scor- 
ing symbol W%—the percentage of total 
responses which use all of the inked sur- 
face of the cards within the framework 
of a single concept. “Tn a normal record," 
say Brussel and Hitch, “about 25 per cent 
of whole answers are expected" (6). The 
Individual Record Blank for scoring Ror- 
schach protocols, developed by Klopfer 
and Davidson, and in wide use, shows a 
normal range of W% extending from 20 
to 4o. Every mean in the present study 
exceeds not only the figure given as a 
normal mean but also the figure given 
as the upper limit in the range of normal 
expectancy for Wo. One might assume 
that the stated norms are in error; how- 


ever, it seems less arrogant to consider 
the possibility that the subjects in the 
present study deviate from the popula- 
tion as a whole somewhat. 

Examination supports the earlier find- 
ing of reduction of whole responses as a 
mere function of repetition of the test. 
When working with percentages rather 
than absolute numbers, this reduction in 
tendency is more dramatically revealed. 
While the decrease in W% is insignifi- 
cant from the first to the second adminis- 
trations, the decrease is significant at the 
1 per cent level between the third and 
both the first and the second adminis- 
trations. C's protocols show the lowest 
mean percentage of whole responses and 
differ at the 5 per cent level of confidence 
with A's records, which show the highest 
nfean W% among administrators. 

While none of the differences in means 
of varied types of administration are sta- 
tistically significant, it is interesting to 
observe that the negative administration 
shows the highest percentage of whole 
responses and the positive situation the 
lowest percentage. 

As W% decreases, it is mathematically 
certain that other location sub-divisions 
will increase. The nature of these in- 
creases can be explored only by analyz- 
ing the frequencies in the data of large 
details, small details, and tiny or unusual 
details as they are employed by the sub- 
jects in concept-formations. 

One clear-cut difference exists between 
Administrators A and С. Administrator A 
elicited the highest percentage of W% 
responses and the lowest percentage of 
D% responses; in both scoring categories, 
A and C differ at the 5 per cent level of 
confidence. 

No other variation in testing situation 
creates differences of statistical signifi- 
cance. Nevertheless, consistent tendencies 
are revealed by the means of the various 
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types of administrations. The negative 
administration provoked the lowest mean 
D% and the positive administration the 
highest. The t-ratio between them barely 
misses reaching the 10 per cent level of 
confidence. 

The location scoring symbol D is used 
when subjects form a concept using a 
large, insular part of the blot, frequently 
perceived as a separate entity by normal 
subjects. Between 45 per cent and 55 
per cent of total responses are expected to 
be D responses. 

Not until the third administration did 
the subjects show any measurable in- 
crease in the use of small usual details 
in the blots. The production of concepts 
based on these small areas remained con- 
stant from the first to the second test but 
differed between the second and the third 
to a degree significant at the 10 per cent 
level of confidence. 

The average intelligent subject is ex- 
pected to have from 5 per cent to 15 per 
cent of his responses in the d% category. 
The third administration, then, most 
nearly provoked d% responses up to a 
minimum of expectancy for our intelli- 
gent subjects. Decreased d% is not gen- 
erally interpreted as having any particu- 
lar significance, and increased d% is 
ignored unless it exceeds 15 per cent of 
the total responses. 

Although there is a lack of significant 
differences between the various adminis- 
trators, Administrator C has the highest 
mean 4% within the administrator 
group, just missing 10 per cent t-values 
with both A and B. 

Dd + 5% is a sort of waste-basket cate- 
gory for all responses based on blot areas 
not defined as W, D, or d. Such responses 
may include tiny details, inside or edge 
details, unusual combinations of large or 
small blot areas, and the use of ground 
as figure. White-space responses were 


analyzed earlier by absolute number be- 
cause of their special qualitative meaning 
in Rorschach interpretation. Their tabu- 
lation in combination with unusual de- 
tails is a necessary part of the process of 
recording percentages of responses by lo- 
cation areas alone. 

There is complete agreement among 
Rorschach authorities that absence of 
Dd + S% is normal. There is, however, 
some difference of opinion as to how high 
this percentage may go before the upper 
limit of normalcy is exceeded. The range 
of estimates is from 2 per cent to 10 per 
cent, with the consensus approaching the 
larger figure. In any event, the records 
of normal persons rarely exceed 10 per 
cent in this category, On all variations 
of the testing situation, the subjects in 
this study remain under this 10 per cent 
maximum; however, the ceiling is ар- 
proached in the means of third adminis- 
tration and of the positively administered 
tests. None of the differences between 
means are statistically significant. 

While Dd + S per cent in excess of 1° 
per cent is interpreted as an abnormal 
sign, even an excess of 5 per cent in the 
category is, by some authorities, consid- 
ered to be evidence of a pedantic ар” 
proach to the test (and, by extension, to 
the environment and to external stimuli, 
in general. All nine of our means €% 
ceed 5 per cent. The foregoing interpre 
tation is more readily made if the higher 
percentage of Dd + S responses occurs in 
conjunction with high W%, a fact which 
we have already observed to exist with 
our group. 

Summary of t-ratios. This completes 
the statistical analysis of the data on the 
numerous Rorschach scoring categories: 
ed zur lengthy analysis of means 

. Lratios by Rorschach scoring catë 
Sores seemed the most clear-cut way © 
presenting the data of this study. How 
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ever, that procedure inevitably produces which each variable, within each of the 
some confusion as to how many statisti- administrative groups, achieved the high- 
cally significant t-ratios were found and est, middle, and lowest means. It will be 
where, how frequently Administrator A noted that the positive administration 
or administration + had the highest and Administrator C reveal almost identi- 
means and on which types of response. cal frequency patterns. No other pairs 
These data are presented in summary show this marked parallel tendency. 
form in Tables II to IV. As Table IV reveals, there were, at the 

Ñ TABLE II 
RANK ORDER or MEANS, ACCORDING TO TYPE ОЕ RESPONSE 
| 
Administration Administrator Туре of Administration 
Type of Response 
I II HI A B с + = o 
R 2 I 3 3 2 I I 2 3 
T т 2 3 3 2 1 Т 3 2 
Е% 3 2 1 I 2 3 3 т 2 
FK І 2 3 2 3 I z = 3 
K I 2 3 s 3 I I 2 š 
Fk š 2 3 2 3 I I 3 2 
#F 3 ї 2 eat 3 2 2 1 3 
ЕС I 3 2 3 2 1 I 2 š 
à CF 1 2 3 3 1.5 1.5 1 3 2 
(d © 1 5 2 1 2 3 т 3 2 
FK+F+Fc/R 3 2 ї 2 I 3 3 І 2 
м 2 І 3 2 3 1 1 2 3 
sumC I 3 2 3 I 2 1 3 2 
FM 2 I 3 2 3 1 I 3 2 
Fm 2.5 235 I 3 2 I I 2 3 
Ес š 2 т 3 2 I I 3 2 
(әл І 2 3 3 2 H І 2 3 
A 2 3 1 1 2 3 3 I 2 
p 3 2 1 2.5 2.5 I 1 3 2 
Ë 2 I 3 4 < 5 I 1 2 3 
A 2 I 3 2 3 1 I 2 3 
È Hd 3 т 2 3 d s š = 8 
Ad 3 2 I 1 2 x 3 m 3 
CR% 2 3 I 3 I 2 s $ > 
W% I 2 3 z = 3 
I 2 3 3 2 1 I 2 3 
bo, 3 2 1 3 2 1 T 3 2 
d% 2 3 I 3 2 1 2 3 т 
° 2 1 3 2 1 I 2 3 
Dd+S% £ 4 2 3 2 I £ 2 3 
5 2 3 2 т т 2 3 
S+(S) n Š 


Table II serves to show the administra- 
tive circumstances under which the means 
of each of the thirty-one Rorschach re- 
sponse categories, herein analyzed, held 
the highest, middle, or lowest numerical 
position within its group. 

Table III shows the 


ym 


frequency with 


1 per cent level of confidence, a total of 
ten differences, over four times as many 
significant t-ratios as one would expect 
by random sampling; four of these were a 
function of repetition of the test, six a 
function of examiner differences. Admin- 
istrator A differed very significantly from 
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B on three of the scoring categories and 
from C on two of the measures. B and C 
differed only once at this level of signifi- 
cance. 

At the 5 per cent level of confidence, 
‘one would expect twelve significant dif- 
ferences by random samplings; whereas, 


TABLE III 


FREQUENCY OF MEAN POSITION, ACCORDING 
TO ADMINISTRATIVE VARIABLE 


Highest 


Lowest 


Adminis- Middle 
tration Mean Mean Mean 

I 13 

II 7 

III II 

A E 3: 
В 3i 19 

С 22} 3i 

+ 23 + 

= 6 14 

o 2 13 


® 
we obtained twenty-one additional differ- 


ences at this level alone. Combined with 
the ten t-values at the 1 per cent level, 
our total is thirty-one é-ratios equal to or 
above the value required for confidence 
at the 5 per cent level. This figure is al- 
most three times chance expectancy. Suc- 
cessive administrations accounted for two 
of the significant differences, and type of 
administration for four. The fifteen dif- 
ferences among examiners point up the 
greater frequency between A and C than 
between cither A and B or C and B. 
Three of the administrator differences 
were between А and B, one between B 
and C, but eleven existed between A 
and C. Combining differences at the i 
per cent and 5 per cent levels, A differs 
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six times with B and thirteen times 


. with C, while B and C differ a total of 


twice. 

Differences (5 per cent level) resulting 
from variation in the emotional tone ol 
the testing situation occurred twice be- 
tween the neutral and negative adminis- 
trations, once between the positive and 
neutral and once between the positivc 
and negative administrations. 

Clearly, the largest number of signifi- 
cant and very significant dillerences at 
the 1 per cent and 5 per cent levels were 
the result of examiner differences; and, 
within our group of examiners, A was the 
greatest contributing factor to frequency 
of differences. The negative and neutral 
administrations appear with equal fre- 
quency (three times each) as contributing 
factors to significant differences in type 
of administrations. The positive adminis- 
tration is a factor in two of the four dil- 
ferences. 

Table IV summarizes the seventeen l- 
ratios at the 10 per cent level of confi- 
dence. This figure brings to forty-eight 
the total number of ¢-values reaching or 
exceeding the 10 per cent confidence 
level, a total exactly twice as large as the 
twenty-four one would expect by random 
sampling. I 

At this level of significance, 


10 per cent, 
the 


Ne variations with type of administra- 
Son predominated, with nine differences 
tending toward significance, Two differ- 
ences were a function of repetition of the 
test, and six occurred with change of ad- 


ministrators, Administrator C was a party 


TABLE IV 


NUMBER oF MEANINGFUL 
Confidence i 
Level 1 II H HI IIIT 
196 £ ° 3 
5% o 1 I 
10% о 2 о 


RATIOS py VARIABLES 


Cumulative 


“еш +o = Total 
š о о 10 
1 2 31 
2 6 I 38 


\ 


EXPERIMENT 
to all of these variations differing three 
times with A and three times with B. Of 
the nine differences occurring with varia- 
tion in the tone of the administration, 
the positively loaded situation differed 
six times with the neutral administration 
and twice with the negatively loaded 
situation. The negative and positive ad- 
ministrations showed differences at this 
level of confidence only twice, the nega- 
tive and neutral only once. Summarizing, 
the positive administration was a factor 
eight times, the neutral seven times, and 
the negative three times. 

Summary of vesults by scoring cate- 
gories. A few of the Rorschach scoring 
categories proved exceedingly stable, 
showing no variation from test to retest, 
from examiner to examiner, from posi- 
tive to negative to neutral administra- 
tion. These stable categories arc the fol- 
lowing: Dd + S per cent, Fk, and num- 
ber of А responses. 

The majority of the Rorschach scoring 
categories showed measurable variability 
as the testing situation varied. The de- 
gree of variation, together with the ad- 
ministrative variable which produced the 
variation in each shifting category, was as 


follows: 


a. R: There was one difference at the 1 per 
cent level of confidence between admis 
and one difference at the 10 per cent level w jn 
variation in types of affective loading of the 
administrative situation. е 

b. T: There were two differences at the 1 
per cent level in average time per response on 

ations and one difference 


the successive administra e d ENS 
at the 5 per cent level between administrations. 
c. M: This scoring symbol proved to be pui 
i i differe he 5 per cer 
varial "ith one difference at the 5 
р t level as а 


r cen 
level and another at the 10 per cer ed s 
function of the varied affective loading | 
ences occurred at the 


iation in administra- 
he 10 per cent level 
f the test. 

inistrations nor 
situation in- 
however, 


testing situation; two differ 
10 per cent levcl with var 
tors, and one difference at th 
grew out of mere repetition о 

d. FM: Neither successive adm 
Variation in affect in the testing 
fluenced FM. Change of administrators. 
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produced one difference at the 1 per cent level, 
another at the 5 per cent level, 

є. Fm: This scoring symbol was resistant to 
change with repetition of the test or with change 
of examiners. It varied only with the variation 
in type of administration, and then only аг 
the 10 per cent level of confidence. . 

f. K: Administrator differences produced. one 
difference in K at the 10 per cent level of confi- 
dence. No other variables in the testing situations 
measurably influenced. this determinant. 

g. FK: This response determinant also re- 
mained stable through successive tests and de- 
spite variation in examiners, but it showed a 5 
per cent level of significant difference with varia- 
tion in affective loading of the testing situation. 

h. Fe: Examiner difference again is evident 
with this determinant, there being both a 1 per 
cent and a 5 per cent difference among admin- 
istrators, Neither retesting nor affective loading 
affected the mean number of Fe produced. 

i. C': This scoring symbol varied once at the 
5 per cent level with repetition of the test, 
twice at the 5 per cent level with change in 
administrators, and once at the 10 per cent 
Tével with variation in tone of administration. 

j. sumC: The composite score sumC remained 
stable from one test to the next and despite 
variation. in affective. loading of the testing 
situation; however, the mean scores varied sig- 
nificantly twice with change of administrators, 
once at the 1 per cent level, once at the 5 per 
cent level. 

k. F%: This factor also resisted change with 
number and type of administration; however, it, 
too, varied with the examiner once at the 1 per 
cent level, again at the 5 per cent level. 

1. A%: The difference in the type of admin- 
istration produced two differences in A%, one 
at the 5 per cent level, the other at the 10 per 
cent level. Neither administrators nor successive 
testing influenced the means significantly, 

m. P: The number of popular responses rc- 
mained somewhat constant throughout the ex- 
periment. One difference, at the 10 per cent 
level, occurred with variation in the affective 
loading of the testing situation. 

n. FK + F + Fc / R: The affective loading of 
the administration did not influence this compos- 
ite scoring factor; however, both successive 
testing and change in administrators brought 
about one difference each at the 5 per cent 
level of confidence. Examiner differences ac- 
counted for an additional difference at the 10 
per cent level. 

o. H: The number of human-figure responses 
varied at the 10 per cent levcl once with change 
in administrators and once with variation in 
the type of administration. 

Hd: Human-detail responses were un- 
affected by retesting or by affective loading of 
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the test situation; however, there was one differ- 
ence at the 5 per cent level with change in 
administrators. 

q. Ad: Animal-detail responses were also im- 
pervious to the emotional tone of the administra- 
tion and to successive testing, but they twice 
showed a difference at the 5 per cent level with 
variation in administrators. 

т. CR%: The percentage of responses to the 
all-color cards remained somewhat stable, show- 
ing only a single difference at the 10 per cent 
level of confidence upon variation of the affec- 
tive loading of the administration. 

s. #W: The number of whole responses 
varied once at the 5 per cent level with succes- 
sive testing but remained impervious to change 
with varied administrator or type of administra- 
tion. 

t. W%: W% remained unchanged with varia- 
tion in emotional loading of the test situation, 
but showed two differences at the 1 per cent 
level with retesting and one difference at the 
5 per cent level with variation in administrators. 

u. D%: This location scoring symbol varied 
once at the 5 per cent level with change in 
examiners but otherwise remained quite uñ: 
changed. 

v. 4%: The percentage of small usual details 
remained stable except for one difference at the 
10 per cent level with successive testing. 

w. S: The number of white-space responses 
did not differ from one examination to the next 
but varied between examiners once at the 5 per 
cent level and differed once at the 10 per cent 
level with alteration of affective loading of the 
administration. 

x. S + (5): The sum of main and additional 
white-space responses resisted change with suc- 
cessive administrations; however, the variation 
in affective loading produced one difference at 
the 5 per cent level. Change of administrators 
proved to be the most significant variable with 
one difference at the 1 per cent level, one at the 


5 per cent level, and a third at the 10 per cent 
level of confidence. 


F. Review or RESULTS RELATED TO 
ORIGINAL HYPOTHESES 


Let us briefly review the specific hy- 
potheses and predictions made at the out- 
set of this experiment and check our re- 
sults against them. 

i, The crucial hypothesis related to 
the stability of responses to the ink blots 
regardless of variation in environmental 
stimuli other than the ink blots; i.e., the 
person presenting the blots, the implicit 
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attitude of the &dministrator, or the mere 
repetition in presentation of the blots. 
Of the twenty-three Rorschach elements 
subjected to statistical analysis, only three 
were relatively unaffected by the extra- 
test stimuli, by "situational" stimuli. 
Nineteen of the twenty-three elements 
entered into differences at least once at 
the 1 per cent, 5 per cent, or 10 per cent 
levels of confidence. These nineteen vary- 
ing factors showed a total of forty-six dif- 
ferences attendant upon some variation 
in stimuli other than the test stimuli— 
the ink blots themselves. The hypothesis 
of stability does not hold. 

2. It was predicted at the outset that 
the number of responses would vary with 
administrators, regardless of number or 
affective tone of the administration. The 
means of responses for the three adminis- 
trators were, roughly, 24, 27, and 33, the 
difference between highest and lowest 
means being significant at the 1 per cent 
level of confidence. These findings sup- 
port the prediction. 

3. No consistent variation in the num- 
ber of responses was anticipated from the 
first to the second to the third administra- 
tion; and, in fact, there was none. Initial 
records of over seventy responses were €X- 
pected to decrease with successive admin- 
istrations, which they did. However, the 
prediction that initial records under 
thirty responses would show successive 
increase in number of responses was not 
Supported by the data, 

,4 Stability as a function of individual 
differences reflected in M : sumC ratio 
Proved, as predicted, to isolate approxi- 
mately 30 per cent of the cases. Also 
fulfilled was the prediction that the more 
Pea ааш records, those of the 
situational ud ei k P" 
Thy em. stability than the records e 
of the шош ‘Thirty-nine per сеп! 

Jects retained a stable Expert 


4 


pee 


DF 


EXPERIMENTALLY INDUCED VARIATIONS IN RORSCHACH PERFORMANCE 25 


ence Type throughout te experiment, 
approximately three-fourths of whom 
were M-type personalities, one-fourth C- 
type. 

5. The hypothesis that there would be 
some variation in the records obtained 
by the three administrators, regardless of 
number or type of administration, was 
more than supported by the data. In fact, 
examiner differences proved the most 
pervasive influence in the experiment— 
perhaps the most dramatic finding of the 
study. 

6. The predicted increase in frequency 
of k, K, c, and С^ with the negative ad- 
ministrations was in no particular real- 
ized. In fact, both Fk and Fc had the low- 
est mean frequency in the negative test- 
ing situation; the other symbols—K, FK, 
and C/—showed means on the negative 
administrations which were exceeded by 
the mean frequencies on the positive ad- 
ministrations. Apparently, as judged by 
these determinants, our thirty-six subjects 
found the positively loaded testing situa- 
tion more challenging or threatening 
than the negatively loaded. This finding 
negates the hypothesis under discussion. 

4. The hypothesized increase in white- 
space responses in the negative testing 
situation was somewhat supported, if one 
judges negative means against control 
means, Both main S and main-plus-addi- 
tional S responses showed a mean in- 


crease in the negative administrations 
over the control administrations. How- 
ever, the negative means of both meas- 
ures were exceeded by the means on the 
positive administrations. Apparently the 
negative administrations did provoke 
more oppositional behavior than the con- 
trol administrations; however, the posi- 
tive administrations elicited even more of 
this type of response than either the nega- 
tive or the neutral administrations, a find- 
ing which, in effect, negates the hypothe- 
sis of peak negatiyistic behavior on the 
negative administration. 

8. Both crude and refined control, as 
measured by F% and FK + F + Fc per 
cent respectively, were expected to in- 
crease in the negative administrations. 
These predictions were supported by the 
data. 

g. The anticipated increase in emo- 
tionality on the positive administration, 
as measured by sumC, was realized. 
There was a steady rise in mean sumC 
from the negative to the neutral to the 
positive administrations. 

10. The increase in movement re- 
sponses (M and FM) on the positive ad- 
ministrations occurred as predicted. Both 
scoring categories achieved their highest 
mean with the positive administration. 
The lowest mean frequency on FM oc- 
curred with the negative administration; 
on M, with the neutral administration, 


V. DISCUSSION 


N THE foregoing report of all of the 
I significant findings for the experi- 
mental variables of this study—adminis- 
trations, administrators, and types of ad- 
ministrations—there was little attempt to 
make qualitative evaluations or dynamic 
interpretations of the findings. Never- 
theless, one is justified in attempting—in 
fact, is obligated to attempt—to relate 
these data to the dynamics of personality. 


A. EXPERIMENTAL VARIABLES 


An effort will be made to summarize 
the effects produced upon the experienc- 
ing subjects by the different variables 
herein studied. Too, an attempt is made 
to understand the differences among the 
three examiners in terms of the dilflerefit 
effects they so consistently produced in 
the test results. While these procedures 
have highly speculative components, 
every effort is made to hold the specula- 
tions close to the actual data, to translate 
into dynamic terms the actual findings 
of this experiment. 

Influence of repetition of the test. The 
mere repetition of the Rorschach was the 
least important variable in the entire 
study, accounting for the smallest total 
number of important ¢-values, only eight 
in all. The categories showing change 
with successive administrations are, for 
the most part, such as can be quite easily 
accounted for. One is not surprised to 
find the average time per response con- 
sistently decreasing with repetition, this 
change accounting for two administra- 
tive t-values. On being successively con- 
fronted with the same stimuli, subjects 
needed no time to adjust to the stimuli: 
they could employ mere recall as a sub- 
stitute [or perception, apperception, and 
conceptualization. Furthermore, there is 
the strong possibility that the subjects, 
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on once learning the total demand or ex- 
pectancy offered by the test-challenge, 
simply functioned in line with our cul- 
tural speed-value, meeting the total re- 
quirements of the situation with as little 
expenditure of time as could be managed. 

We have elsewhere indicated that our 
subjects frequently showed moderately 
compulsive tendencies. As a defense 
mechanism, the compulsive tendencies 
would be expected to be most operative 
when the subjects felt most challenged 
or threatened. One influence of repeti- 
tion of the test on this dynamic pattern 
would be reduction of threat or challenge 
and consequent reduction in display of 
the compulsive defense, Three of our 
eight administration (-values grow out of 
reduced use of the whole blot in concept 
formation on successive tests. This phe- 
nomenon would seem to need no further 
comment. 

Repetition of the Rorschach induced а 
successive and consistent increase in the 
use of “refined control.” Again, one does 
not have to delve into deep dynamics of 
personality to understand this finding: 
Our intelligent subjects merely demon- 
Strated their ability to improve their 
mastery over a challenging situation with 
each successive association with the test 
task. 

Ns remaining measurable influence of 

Petition on the test results is concerned 
with M, human movement responses. On 
taking the test for the second time, our 
sibi manifested greater equilibrium, 
more inner adjustment, a higher level of 
ena айта hii ine wit 
the display oa M oput ume 
situation whi pais GORÎ oe 2 
mini 2 which attended successive ad- 
dme The third administration 

ame test, however, failed to stimu- 


pem. 


h 


EXPERIMENTALLY INDUCED VARIATIONS IN RORSCHACH PERFORMANCE 27 


late our subjects to displdy this sign of 
the superior, well-adjusted person. There 
is a measurable decrease in human move- 
ment responses on the third administra- 
tion. One may speculate that the sub- 
jects were inhibited on the first adminis- 
tration, were less inhibited and more se- 
cure on the next association with the 
blots, hence free to demonstrate optimal 
capacity to absorb and creatively utilize 
stimuli, but were perhaps just bored with 
the third presentation of the same blots, 
bored to the point of not bothering to 
behave optimally. 

The suggestion that boredom attended 
the third testing situation is somewhat 
the spontaneous comments 


supported by i 
ects during the third 


of many of the subj 
session: “These again?” “Tve already 


done this twice.” “Same old thing—a 


butterfly," etc., etc. 

Effect of Examiner A. Among adminis- 
trators, A elicited responses which reveal 
the following functioning of the subjects, 
as indicated by rank order of means: Few- 
est number of responses, shortest average 
least degree of emo- 


time per response, 
social adjustment 


tional adjustment or 1 ne 
(FC), least amount of emotional lability 
(CF), smallest measure of emotionality 
(sumC), least expression of sensitivity 
(Fc), least tendency to use all possible 
environmental stimuli (С), least stimula- 
tion of critical or perfectionist attitudes 
toward concepts (Ad and Hd), lowest fre- 
quencies in all location areas except w% 
wherein A's subjects show the highest 


mean. 


‚ Р п- 
The signs on which A's mean freque 
r suggest 


cies were highest in rank order А 
the following reactions to the effect o 
this administrator on the subjects: Great- 
est amount of crude control (F%), most 
frequent manifestation of unhealthy, un- 
Controlled direct emotional interaction 


with the environment (C), strongest tend- 
ency toward compulsive behavior (W%). 

The pattern which seems to form, as 
one reviews these summarized effects of 
Administrator А on the subject’s Ror- 
schach records, somewhat resembles what 
one might expect of a person confronted 
with a threatening, frustrating situation. 
If this total effect may be considered a 
mirror of the administrator's personality, 
then Examiner A would be described as a 
cold, forbidding, frustrating, threatening 
figure, these personality components per- 
meating the testsituation regardless of 
the deliberate and varied role-playing 
attempted. 

There is no way of checking the valid- 
ity of this thumb-nail sketch of A's per- 
sonality components as reflected in her 
é¥ect on Rorschach records. The writer, 
however, did ask two psychologically so- 
phisticated persons, acquainted with all 
three examiners, to give subjective de- 
scriptions of each. Both described A as 
the coldest, most inflexible, and most 
solid of the examiners. One added the 
adjective "masculine" and the phrase 
“castrating type of female.” 

Effect of Examiner B. Administrator 
B's results fell in the middle position оп 
rank order of means with the majority 
of measures analyzed in this study. The 
relative effect of her personality on the 
administrations is revealed in part, per- 
haps, by the very paucity of extreme posi- 
tions achieved under her administrations. 
Following are the measurable effects: 
smallest amount of introspection (ЕК), 
lowest indication of maturity or creative 
imagination (M), and least evidence of 
bouyancy or free expression of primitive 
drives (FM). 

The subjects, under B's administra- 
tions, achieved rank order of 1 on only 
two significant means of categories: high- 
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est total degree of emotionality (sumC) 
and most refined control (FK + F + 
Fc/R). 

The pattern reflected in the mirror of 
effects suggests that B may possess per- 
sonality traits that are emotionally excit- 
ing to the subjects; at the same time, this 
excitement apparently is well controlled 
and does not provoke undue anxiety or 
tension, neither does it stimulate intel- 
lectual activity. 

It will be remembered that Administra- 
tor B is the youngest of the three ex- 
aminers. She was subjectively described 
as the most feminine of the group, the 
“softest,” most nearly the “ideal protect- 
ing mother-figure." One person described 
her as essentially "seductive." 

Effect of Examiner C. Administrator C 
stimulated in subjects the least amount 
of either crude or refined control (Е, 
and FK + F + Fc/R), the lowest meas- 
ure of uncontrolled emotionality (C), and 
the least indication of "whole compul- 
sion" (W%). 

The means of the significant measures 
showing highest rank order under C's 
administrations are the following: Most 
responses with shortest average time per 
response, most introspection and unfixed 
anxiety (FK and K); best social and emo- 
tional adjustment (FC), most evidence of 
creative imagination (M) and of bouy- 
ancy (FM), highest indication of sensitiv- 
ity (Fc), freest use of all possible stimuli 
(C/), most evidence of ease in relating to 
people (H), greatest evidence of critical 
or perfectionist attitudes toward produc- 
tions (Ad and Hd), greatest use of all 
blot location areas except W% wherein 
C's subjects show the lowest mean, 

The effect of Examiner C on the results 
of the Rorschach are more complex than 
the effects of the other examiners. One 
may speculate that C provided more of 
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an intellectual than an emotional stimu- 
lation to the subjects, 'There would seem 
to be evidence of challenge with attend- 
ant anxiety, but there is also evidence of 
easy rapport, social adjustment, and less 
need for control devices in the situation 
with C. 

C is the eldest of the three examiners. 
She was subjectively described as the most 
flexible of the three, as more feminine 
than A but less than B. One person de- 
scribed her as exuberant, "bubbling," 
the other as “very sympathetic." 

Before leaving these speculations on 
the effects of the various examiners, it 
should be pointed out that Administra- 
tor А quite consistently approximates the 
pattern of the negative administrations 
with sixteen agreements out of a possible 
thirty-one, Administrator В most nearly 
approximates the pattern of the neutral 
administrations with fourteen agreements 
out of thirty-one. Administrator C almost 
duplicates the pattern of the positive ad- 
ministration with twenty-six agreements 
out of the possible thirty-one. 

One may speculate that the basic per- 
sonalities of the examiners, which per 
meated all testing situations regardless of 
the varied affective roles they played, are 
in fact related: A — negative; B — neu 
tral; C = positive, Or one might infer 
that Administrators A and В, for exam- 
ple, in deliberately structuring the post 
tive administrations actually created а 
situation somewhat like that which was 
рк and unintentionally created 
G i ea administrations, that A = 
au dy ых ing to be neutral succeece Е 

à embling B's essential person 
ality, and that B and C in structuring 
negative situations succeeded merely in 
resembling A's basic pattern. The forme? 
interpretation seems more likely. Which- 
“ver Interpretation appears more reason- 


M 


P o I 


EXPERIMENTALLY INDUCED VARIATIONS IN RORSCHACH PERFORMANCE 29 


able, the facts and figures) serve further 
to underline the importance of examiner 
differences. 

Effect of positive administrations. This 
highly permissive situation was deliber- 
ately structured to give subjects the 
feeling of acceptance; the atmosphere was 
one of approval from beginning to end. 
This procedure brought about significant 
changes in the Rorschach behavior of 
the subjects, as has been previously noted. 
Let us summarize the effect of these posi- 
tive administrations: 

More responses were elicited, suggest- 
ing greater productivity in the more per- 
missive situation. These responses tended 
to incorporate all possible stimuli pre- 
sented, There was a greater display of in- 
tellectual activity, of creative imagina- 
tion, plus evidence of increased thinking 
of a popular or communal type. Stereo- 
typy of thinking was markedly low dur- 
ing the positive administrations. There 
was evidence of increased resistance 
against feelings of personal inadequacy 
plus a rise in tension-tolerance and in 
self-questioning, or introspection. Finally, 
there was marked evidence ofa greater 
ease in relating to other human beings. 

Effect of negative administrations. The 
negative test-situations were non-permis- 
sive, rejecting; an atmosphere of disap- 
throughout. The effect 
of this environmental tone on the Ror- 
schach records is reflected in a definite 
increase in stereotypy of thinking and a 
decrease in thinking that is in harmony 
with community thinking. There is @ 
low incidence of intellectual activity of 
the imaginative or creative sort. Too, 
there is evidence of withdrawal from emo- 
tional stimuli, as measured by a decrease 
in the number of responses to the all- 
color cards, There is а marked rise in 
self-questioning, or introspection, plus 
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evidence of resistance against feelings of 
personal inadequacy. 

Effect of neutral administrations. The 
chief value in the experimental design of 
the neutral set of Rorschachs is that of a 
control factor. Since these tests were ad- 
ministered in the standard manner, with- 
out affectively loaded pre-tests, they serve 
somewhat as a measure of the reactions to 
the Rorschach which one might expect to 
find under ordinary clinical situations. 
The results on the neutral Rorschachs, 
however, cannot be construed to have the 
same meaning as is ordinarily attributed 
to the results of control groups in psy- 
chological experimentation. A third of 
our subjects had had one affectively 
loaded Rorschach before experiencing 
the neutral situation; two thirds had had 
tWo affectively loaded experiences with 
the test before receiving the neutral ad- 
ministration. It would be folly to assume 
that all of these neutral administrations 
provided identical test-situations for our 
subjects, that there was no carry-over 
from previous experiences with the ad- 
ministrative tone in which the blots were 
presented. Nevertheless, our neutral ad- 
ministrations do offer a modicum of evi- 
dence as to how subjects react to the test 
when standard testing conditions prevail. 

The neutral situation elicited fewer 
responses than the positive, and these 
responses showed higher stereotypy of 
content than occurred on the positive ad- 
ministrations. There was less interest in 
utilizing all environmental stimuli, di- 
minished ease in inter-personal relation- 
ship, and diminution of intellectual 
creativity on the neutral-as compared 
with the positive—administrations. Lower 
tension-tolerance and less resistance to 
feelings of inadequacy were also appar- 
ent in the neutral, over the positive, ad- 
ministrations. A lower resistance reaction 
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was evident on the neutral administra- 
tions when compared with the negative 
situations. Less introspection was appar- 
ent on the neutral than on the positive 
administrations, and there was greater 
reaction to the emotional stimulation of 
all-color cards. 


B. IMPLICATIONS FOR FURTHER 
INVESTIGATION 


This study does not supply the answer 
to a question which grows out of the find- 
ings: What kind of personality was pos- 
sessed by each of the three administra- 
tors? Since administrator differences ap- 
peared as the greatest contributing factor 
to the variation in the Rorschach records, 
one would like to know what specific and 
measurable factors in an examiner's per- 
sonality will bring about what specific 
and measurable factors in a subject's Ror- 
schach protocol. One wonders if a Ror- 
schach psychogram of the three examin- 
ers would resemble the composite mean 
profiles of the subjects in every way or in 
any way. These problems should be in- 
vestigated. 

The present study was designed in such 
a way that a subject could withdraw with- 
out suffering too great a penalty. Conse- 
quently, several of the individuals who 
reacted intensely to the negative adminis- 
tration withdrew from the project with- 
out finishing the three tests. Their 
records, of course, could not be included 
in this study. This entire experiment 
should be repeated with a group of sub- 
jects over whom the examiner can exer- 
cise greater control. The fact that differ- 
ences were revealed as a function of the 
negative administration makes one won- 
der how much more significant these dif- 
ferences might have been—and what 
additional differences might have ap- 
peared—if the records of the intensely 


affected subjects could have been incor- 
porated in the data. 

The experimental design of the present 
study required that each subject not 
only experience three different affective 
loadings of the. test situation but also 
experience three different examiners. 
Two separate investigations should be 
made as a supplement to the present 
study. It would be well to know whether 
the same kind and amount of fluctuation 
of scoring symbols and M : sumC balance 
would occur if a single examiner tested 
à group of subjects under varied affec- 
tively loaded situations or if several ex- 
aminers successively tested a group of 
subjects under the same affectively loaded 
circumstances. This latter study should, 
in fact, be broken into at least two stu- 
dies, one investigating the results of a 
series of positively loaded situations, and 
the other exploring the results of a series 
of negatively loaded administrations. 

In order to minimize the personal 
equation and thereby to study in purer 
form the influence of negative and post 
tve rapport factors on Rorschach pet 
formances, one could repeat this experi- 
ment w 


ith the following variations: neu- 
tral, 


ral, positive, and negative administra- 
tons could be administered in group 
form, according to the design herein 
used, with directions or instructions given 
by а single examiner through the me 
dium of phonographic recordings. This 
examiner would, of course, have to cut 
three records, each affectively differently 
loaded. While such а procedure could not 
effectively incorporate the particular 
eee Pre-tests used in this study а 

Setters for the administration, other 
X simple devices could be created: 

act, the findings of the present study 
Suggest that no such elaborate preamble 
1S necessary to instill feelings of accep 
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tance or rejection. А c@d reception 
proved adequate to disturb our subjects 
to the point of misunderstandings and 
errors. 

The significant findings for the group 
of male college sophomores justify a repe- 
tition of this entire experiment with 
other age groups and with female sub- 
jects. While our subjects were randomly 
selected, they were not selected from the 
population as a whole but from a rather 
small stratum of a very small and highly 
selected population—college sophomores. 
Our group was shown to deviate from 
population norms in several instances; 
e.g., in compulsive tendencies and pedan- 
tic approach. One may only speculate on 
the extent to which other findings in this 
experiment are a function of the select 


group from which the subjects were 
drawn. Would a similar study, with sub- 
jects representing a cross section of the 
population as a whole, show more or 
less variation, show the same or different 
kinds of instability? 

All of these questions, and many more, 
must eventually be answered in the lab- 
oratory if we are to refine our use of the 
most popular and pragmatically valuable 
of the projective techniques: The Ror- 
schach Ink Blot Test. This study is a 
small step in a direction which must, and 
certainly will, be taken by other students 
who realize that the practical success of 
of the Rorschach must be supported by 
laboratory verification. This area is rich 
in experimental problems sufficient to 
excite at least a generation of students. 

^ 


© 


VI. SUMMARY AND CONCLUSIONS 


His investigation has attempted to ex- 
[ке the stability or variability of 
the numerous response-determinants of 
subjects’ concepts on the Rorschach Ink 
Blot Test, to discover the extent to which 
mere repetition of the test, a change of 
administrators, or an alteration of the af- 
fective tone of the testing situation would 
bring about alterations in number and 
content of responses, in determinants, 
and in location areas. 

Related studies are extremely few and 
their contradictory results are explain- 
able as a function of their decidedly dif- 
ferent experimental designs. A single 
study, that of the Army Air Forces, is 
closely related to the present experiment 
in its analysis of variation in number of 
responses as a function of examiner- 
differences. 

What other response categories are sub- 
ject to fluctuation as a function of 
examiner-differences? If examiners studi- 
ously attempt to structure three differ- 
ent affectively loaded testing situations, 
will there be consistent differences, or 
any differences, as a function of these de- 
liberately varied rapport situations? Will 
mere repetition of the Rorschach signi- 
ficantly alter any of the scoring cate- 
gories, regardless of administrator or of 
affective loading of the administration? 
If so, which scoring categories are sensi- 
tive to such variation? 

Thirty-six male college sophomores be- 
tween the ages of nineteen and twenty- 
seven participated in a series of three 
Rorschach tests, administered without 
testing of the limits. Each test was admin- 
istered by a different female, and each 
testing situation was affectively different- 
ly loaded. One test was given in the 
standard manner without pre-tests and 
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without any attempt to give the subject 
a marked feeling of acceptance or rejec- 
tion. Another test was administered fol- 
lowing two pre-tests during which, by 
manner and tone of voice, the subject 
was made to feel rejected and a failure. 
Still another test was administered after 
a similar pre-test situation during which 
the subject was made to feel accepted and 
successful. Each examiner gave twelve 
tests first, second and third; twelve tests 
positively loaded, negatively loaded, and 
neutral without affective loading. Each 
subject’s different test was administered 
by a different examiner. An equal num- 
ber (twelve) of negative, positive, and 
neutral tests was given on the first, sec 
ond, and third administrations and by 
the three different examiners, This plan 
netted thirty-six Rorschach protocols on 
each of the three administrations, thirty- 
six by each of the different examiners: 
and thirty-six under each of the three dil 
ferent affective loadings of the testing 
situation. The one hundred eight КО! 
schach protocols were scored and sepa- 
rately analyzed according to the number 
of the administration, the administrator: 
and the type of administration. 

On the basis of the analyses of the 
one hundred eight Rorschach records. thé 
following findings are reported: Subject 
who were coldly received tended to m! 
understand, and make errors in following 
directions on simple card-sorting tests- 
Subjects warmly greeted did not mis 
understand, request repetition of, 
make errors in following simple direc 
tions. 

The Experience Balance, or M : sumC 
Tatio, was unstable, only 30%, of the sub” 
Jects maintaining a constant balanc? 
throughout the variations in adminis ^ 


De a 


c 
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tion. The remainder shifted emphasis 
from one side of the ratio to the other 
at least once. 

Теп, thirty-one, and forty-eight ¢-ratios 
were found at the 1 per cent, 5 per cent, 
and 10 per cent levels of confidence re- 
spectively. Of these forty-eight measura- 
ble differences, eight were a function of 
repetition of the test, thirteen a function 
of the variation in affective loading of 
the test situation, and twenty-seven а 
function of examiner differences. 

Of the twenty-three Rorschach ге- 
sponse-categories subjected to statistical 
analysis, three were relatively unaffected 
by the extra-test, or "situational" stimuli: 
Dd + 59%, Fk, and А. The other twenty 
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categories showed frequency differences 
at least once at the 1 per cent, 5 per cent, 
or 10 per cent levels of confidence under 
at least one of the experimentally varied 
situations. 

In conclusion, performance on the Ror- 
schach, as measured by the frequency of 
responses in the Rorschach categories, 
varies significantly with repetition of the 
test, with variation in the negative or 
positive rapport conditions of the ad- 
ministration, and with examiner differ- 
ences. In the present study, the largest 
and most frequent variations in Ror- 
schach performance were associated with 
examiner differences. 


APPENDIX 
EXPLANATION OF SCORING SYMBOLS USED IN THIS STUDY* 


LOCATION 


W Whole blot 

W,S Whole blot and white space used (tabu- 
lated as main W and additional S) 

al detail 

cc used in addition to D (tabu- 
lated as main D and additional S) 

d Small usual detail 

Dd Unusual detail (or unusual combinations of 
usual arcas) 

S White space 


D Large us 
D.S White sy 


CONTENT 
H Human figures 
Hd Parts of human figures, not anatomical 


\ Animal figures 
Ad Parts of living animals 


Bruno and Helen H. David- 
son, The Rorschach Method of Personality Diag- 
nosis, Individual Record Blank (New York: 


World Book Company, 1912). 


* After Klopfer, 


POPULARITY 
P Popular responses 
DETERMINANTS 


R Response 

M Figures in human-like action 

FM Animals in animal-like action 

m Abstract or inaminate movement 

k Shading as three dimensional expanse pro- 
jected on a two dimensional plane 

K Shading as diffusion 

FK Shading as three dimensional expanse in 
vista or perspective 

F Form only, not enlivened 

Fc Shading as surface appearance or texture, 

differentiated 

Achromatic surface color 

Definite form with bright color 

Bright color with indefinite form 


Color only 


[j 


6. 


M 


10. 


. BOCHNER, RUTH, & HALPERN, FLORENCE. 


. BUHLER, 


. FOSBERG, I. 


. Hunt, W. 
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AN EVALUATION OF PERSONALITY-TRAIT RATINGS OBTAINED 
BY UNSTRUCTURED ASSESSMENT INTERVIEWS 


I. PROBLEM, PROCEDURES, AND METHODS 


HE TERM "Interview" as usually de- 

fined means almost any face-to-face 
contact between two or more individuals 
which involves the exchange of informa- 
tion, When the purpose of the interview 
is to obtain information about the per- 
son being interviewed, it is usually called 
a diagnostic or appraisal interview. It is 
with this type of interview that this in- 
vestigation is concerned, 


As pointed out by Fearing (6, 7), the appraisal 
interview is "probably one of the oldest human 
social techniques," and is also "the least studied, 
the most constantly used, and the most fre- 
quently challenged method of securing social 
data. 

Studies of the reliability of judgments based 
on the interview (2, 7. 0, 10, 14, 15), although 
extremely divergent with respect to method of 
interview and the type of data selected for analy- 
sis demonstrate that these judgments will vary in 
reliability according to: (a) the person doing the 
interview; (b) the amount of objective data 
available to the interviewer before the inter 
and (c) the degree of structuring or standardiza- 
tion of the interview itself. 

The validity of judgments based on the inter- 
view technique has been less adequately studied 
than has the reliability of interview judgments. 
In 1931, Symonds (17) pointed out that inter- 
viewing “has not been subjected to experimental 
scrutiny or statistical validation.” A survey of 
the literature by the present writer did not indi- 
cate that the situation has been changed since 
1931. In none of the studies (1, 4. 5. 10, 18) was 
any estimate made of the contribution of the 
interview itself to the validity of the interviewer's 
judgments. In every case the validities given 
were for judgments based on the interview 
plus other material. As a group, the validity 
studies indicated that little evidence has been 
upport the belief, prevalent among 
and psychologists as well as per- 
sons in general who deal with other people, that 
face-to-face contact in an interview situation 1s 
very necessary (and sometimes sufficient) for meas- 
uring personality traits and predicting behavior. 


gathered to s 
psychiatrists 


The present investigation is an at- 


tempt to determine the validity of per- 


sonality-trait ratings based on two types 
of unstructured interview situations and, 
further, to determine the increase in 
validity of interview ratings over that of 
ratings made by the same individuals 
without benefit of an interview. The 
data analyzed in this study were gathered 
during the 1947 Michigan Assessment 
Program." If in some instances the data 
appear incomplete or the results incon- 
clusive it should be understood that the 
Assessment Program was not designed? 
to thoroughly investigate any particular 
technique but rather to determine the 
maximum validity of many techniques 
combined. 
A. SUBJECTS 

The sample investigated in this study 
consisted of 128 male college graduates 
who had been accepted by various uni- 
versities for training in clinical psychol- 
ogy leading to the Ph.D. degree under 
the Veterans Administration training 
program. All were beginning graduate 
students accepted at the P-1 level—the 
lowest professional grade in.the U. S. 
Civil Service system. 


1'The 1947 Michigan Assessment Program was 
conducted as a part of the Research Project on 
the Selection of Clinical Psychologists sponsored 
by the Veterans Admin tion under a contract 
with the University of Michigan. For a detailed 
report of the overall aims and research design 
of this project, the reader is referred to the 
project's Preliminary Report issued in Decem- 
ber, 1948 (11). 

*The writer, although associated with the 
project since September, 1946, had no part in 
the planning or experimental design of the 1947 
Michigan Assessment Program and wishes to 
accept no credit therefor. Neither does he wish 
to be held responsible for certain justifiable 
criticisms of the design such as the lack of 
estimates of the reliability of interview ratings 
and the lack of independence between interview 
ratings and criterion ratings. 


N 


B. THE RATERS 

The subjects were studied intensively 
over a period of one week by a staff of 
thirty clinicians. Two of these were 
psychiatrists. The majority of the others 
were professional clinical psychologists, 
some being advanced graduate students 
in clinical psychology. Ratings made by 
these clinicians comprise the data of this 
investigation, 


C. THE RATING SCALE 


The rating scale used in the Assessment 
Program comprises 42 variables divided 
into a Scale A (a group of 22 bi-polar 
so-called surface traits—supposedly the 
more manifest dimensions of personal- 
ity, a Scale B (a group of 9 so-called 
source traits—supposedly the more under- 
lying dimensions of personality), and & 
Scale C (a group of 11 so-called criterion 
variables, which in reality аге various 
skills believed necessary for successful 
functioning as a clinical psychologist). 
In addition, ratings were made on Lik- 
ing (Variable Жо), defined simply as ex- 
tent of personal like or dislike for the 
subject. In rating students on Scales A 
and B the staff was instructed to rate the 
person as he was at the time of assess- 
ment. In rating on Scale C the student 


was to be rated as he would be five years 


in the future (i.e, when he would have 
one year of professional experience past 
the Ph.D. degree). Definitions of Scales 
B and C variables are given in the Ap- 
pendix. 

АП three rating scales were the prod- 
uct of joint thinking on the part of the 
project planning committee and were 
the outgrowth of trial scales used in 
preliminary assessments the year before. 
Scale A was based in part on the findin 


of Cattell (8) in his factor analyses 
personality ratings. 
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In using the(rating scales, raters were 
instructed to use as a frame of reference 
"all first year clinical psychology gradu- 
ate students at universities accredited by 
the American Psychological Association 
to offer training in clinical psychology." 
Raters were asked to make their ratings 
conform roughly to a normal distribution 
with suggested frequencies designated as 
3, 7 15, 25, 25, 7, and 8 per cent, respec- 
tively, for each point from 1 to 8. 


D. Tue INTERVIEWERS 

A staff team of three members studied 
each student intensively. This team con- 
sisted of an Initial Interviewer, an In- 
tensive Interviewer, and either a ‘Test 
Integrator or a Projective Integrator. 
Each staff team studied four students 
from each class of 24 students. Sixteen 
staff member functioned as Initial and 
Intensive Interviewers interchangeably 
so that for each student team of four stu- 
dents, one staff member would act as 
Intensive Interviewer for two students 
and as Initial Interviewer for the other 
two students. The group of 128 students 
studied in this investigation thus received 
two sets of interview ratings—one sct 
made after the Initial interview (herein- 
after designated as Init ratings) and one 
set made after the Intensive interview 
(hereinafter called Intens ratings). Since 
the 16 interviewers acted in both the 
Initial and Intensive interviewer capacity 
for the 128 students (although no staff 
member functioned as both Intensive 
and as Initia] Interviewer for the same 
student), the problem of interviewer dif- 
ferences does not arise when comparisons 
are made between ratings based on the 
Intensive interview situation and ratings 
based on the Initial interview situa- 
ton. Each of the interviewers inter- 
viewed from two to twelve students in 
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the role of Initial Intgyviewer and a 
similar number in the role of Intensive 
Interviewer. 

All 16 interviewers (2 psychiatrists and 
14 professional clinical psychologists) had 
had considerable interviewing experience 
before assessment. Without doubt they 
may be regarded as being more skilled 
at uncovering personality dynamics by 
use of the unstructured interview than 
the general population of interviewers. 
In fact all could undoubtedly be termed 
expert interviewers. 


E. Tue INITIAL INTERVIEW SITUATION 


The Initial Interview was an unstruc- 
tured type of interview lasting approxi- 
mately one hour. The material covered in 
this interview was purposely kept at а 
fairly superficial level so that little 
anxiety would be aroused in the subjects. 
The Initial Interviewer had available to 
him before the interview only the infor- 
mation about the subject usually found 
in a credentials file: i.e. application 
blank (Civil Service Form 57), letters of 
recommendation, and records of past 
performance in the form of college tran- 
scripts. The Initial interviewer is prob- 
ably comparable to the usual college in- 
terview for the purpose of determining 
admission elegibility, and is similar to 
the typical interview (although somewhat 
longer in duration) given applicants for 


employment in industry. 


F. THE INTENSIVE INTERVIEW SITUATION 


The Intensive interview 
structured. It lasted approximately two 
hours. The Intensive Interviewer had 
available before the interview the wealth 
of objective, projective, and autobio- 
graphical material described in detail be- 


low under Design of Assessment (Section 
terview was 


I, G, 1). The Intensive in 


was also un- 


deep and probing in nature: the inter- 
viewer attempted to uncover underlying 
personality dynamics in order to con- 
firm hypotheses suggested by his review 
of the data available on the student, or 
to fill gaps which persisted in spite of 
all the data available. The Intensive 
interview is probably comparable to the 
interview used for diagnostic purposes 
with a neuropsychiatric patient by a psy- 
chiatrist who has available to him the 
social history, letters from friends and 
relatives, and psychological test reports. 


G. THE DESIGN or ASSESSMENT 


Lack of space prevents the presenting 
of the experimental design of the 1947 
Michigan Assessment Program in detail. 
However, some information concerning 
the assessment is desirable so that the role 
of the interviews may be seen in proper 
perspective. Therefore, a typical student 
("X") will be followed through the as- 
sessment program. In addition, the vari- 
ous assessment ratings will be described 
in some detail. 


1. The students’ assessment schedule. Students 
came to the assessment center їп groups of 20 
to 24 for periods of seven days. Soon after his 
arrival at assessment headquarters, “X” and his 
classmates were given an orientation by the 
Project Director, at which time the aims of 
assessment were reviewed and the students as- 
sured that personal material would be kept in 
strictest professional confidence. "X" was also 
told that since certain staff members would be 
rating him on the basis of certain specified 
materials he would have absolutely no contact 
with those staff members until after those par- 
ticular ratings had been made. The students 
were then assigned to teams of four. 

During his first several days of assessment, 
"X" took a battery of paper-and-pencil objective 
tests (Miller Analogies Test, Chicago Test of 
Primary Mental Abilities [Single Booklet Edi- 
tion], Allport-Vernon Study of Values, Strong 
Vocational Interest Blank, Minnesota Multi- 


3For a complete description of the assess- 
ment see the Preliminary Report (11) referred 
to earlier. 
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phasic Personality Inventory, and the Guilford- 
Martin Battery—Inventory of Factors STDCR, In- 
ventory of Factors GAMIN, and Personnel In- 
ventory I): Scores on these instruments were 
made available to the assessment staff. The asses- 
sce was given a projective test battery consisting 
of the Rorschach, the Bender-Gestalt, a sentence 
completion test, and ten cards of the Thematic 
Apperception Test. He filled out a biographical 
inventory (consisting of 131 items), a psychologi- 
cal experience record (for aid in evaluating ob- 
jective and projective tests), and wrote from an 
outline a detailed autobiography (average length 
about 20 pages), He was interviewed by an Initial 
Interviewer, and later interviewed again by an 
Intensive Interviewer. 

On the fifth day of assessment, "X" was put 
through a period of situation or role-playing 
tests designed to bring out salient features of the 
student's interactions with others He was ob- 
served and rated at this stage by his own staff 
team (the two interviewers and the Projective 
or Test Integrator), and was also observed and 
rated by another staff team who had never seen 
him before and who had no information avail- 
able about him. That evening, all students and, 
staff relaxed with a social gathering. 

After breakfast on the last morning, "X" was 
asked to fill out a sociometric questionnaire 
(typical questions: “Who would you most like 
to go on a camping trip with?” and “Which 
student would you prefer to be supervised by in 


‘In addition to these objective tests, the stu- 
dents took the Kuder Preference Record, the 
ACE General Culture Test, and the Strong 
Vocational Interest Blank twice more—once with 
instructions to fill it out as it would be answered 
by “women in general” and finally as it would 
be answered by “men in general.” Scores on 
these tests were not made available to the as- 
sessment staff but were filed away for later 
independent analyses. 

"The situation tests were patterned after 
the psychodramatic (role-playing) technique of 
Moreno. For a complete description апа thor- 
ough analysis of situation tests as used in 
the 1947 Michigan Assessment Program, the 
reader is referred to an article by Dr. William 
Soskin (17). One typical situation test involved 
two subjects, one of whom was informed in ad- 
vance that he was to play the role of a small- 
town high school principal who had called 
one of his male instructors in for a conference 
regarding the instructor's rumored sexual mis- 
conduct. The other subject was informed that 
he was to play the role of a high school instruc- 
tor who had been told the principal wished 
to talk to him but was not informed what the 
conference was about. The ensuing five or ten 
minutes of role-playing consistently called forth 
aspects of the students' personalities that sur- 
prised and enlightened even the participants. 


your work?" concerning his reactions to his 
classmates. In addition, he was asked to rate 
himself and his three teammates on Scales A, 
B, and C (using a slightly modified form of the 
staff rating form). Finally he was asked to 
Prepare concise but frank character sketches 
about his teammates, 

When these tasks were completed, "X" had a 
final appointment with his intensive interviewer. 
Although he hoped to obtain from that inter- 
View the staff's opinion of him, he went away 
satisfied with a small amount of relatively in- 
nocuous data, such as test scores, 

Finally, “X” and his classmates met for a 
group-therapeutic session conducted by a visiting 
psychologist (who had taken no other part in 
the assessment program) and a farewell talk by 
the Project Director, who thanked the group 
for their cooperation and repeated the assur- 
ance that assessment findings would be kept 
in strictest confidence, 


2. Ratings made during assessment. 
The preceding section indicates the wide 
variety of psychological techniques used 
in assessment. Ideally, the program 
would have been designed so that cach 
technique could be evaluated separately, 
with staff members rating independently 
each student on each technique, Un- 
fortunately, the expense of such a design 
(in terms of staff and student time às 
well as money) was prohibitive. As а 
compromise it was decided to make rat- 
mgs on various combinations of psy- 
chological data and to design the assess- 
ment program so that even though esti- 
mates of the validity of ratings based on 
all techniques might not be obtained di- 
rectly, it would be possible to estimate 
the increase in validity of ratings ор- 
tained with the addition of different psy- 
chological даға, Wherever practical, in- 


"Immediately after each set of ratings was 
made, the ratings were filed away in the project 
oce and мете not again made available to 
until the Final Pooling Con- 
» Notes made by any rater were 
d Be rater nor were ve 
СОЧ with oth il the 
Preliminary Conference, "hus the peek 
example, when making ratings 
ve interview had available only 
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dependent ratings on Various separate 
psychological techniques were obtained 
as well. In general, the psychological 
data were utilized in order of increasing 
cost; crcdentials material costing only a 
postage stamp was presented to the raters 
first; objective tests costing only slightly 
more were presented second, etc. 

The following ratings were made dur- 
ing assessment: 

a. Ratings based on each of the four 
projective techniques made by the staff 
member administering the particular 
technique. 

b. Ratings based on the four projec- 
tive protocols plus interpretations of the 
protocols. These ratings were made by 
the Projective Integrators on half the 
students. 

c. Ratings made on the other half of 
the students by the Test Integrators. 
These ratings were based on all projec- 
tive material and in addition on all cre- 
dentials material, all autobiographical 
material and all objective test data. 

d, Ratings made by the Interviewers. 

(1) PreInit ratings. These ratings 
were made by the staff member serving as 
Initial Interviewer prior to the interview 
—i.e., before the raters had seen the sub- 
jects. These ratings were based entirely 
on the material contained in the Cre- 
dentials folders. А 

(2) Init ratings. These ratings 
were made by the Initial Interviewers 
immediately after the Initial interviews. 
Thus, they were based on Credentials 
materials plus the information obtained 
during the Initial interview. 


(3) PreIntens ratings. These rat- 


ings were the fourth set of ratings made 


i e 
the written test material and ше поза fu 
himself had made during the inten pe 
no knowledge of the ratings ог the op 
any other person. 


by the Intensive Interviewers before see- 
ing the subjects. The Intensive Interview- 
ers had previously rated on Credentials 
material alone; then on Credentials plus 
Objective test data; then on Credentials 
plus Objectives plus Autobiographical 
material. The PreIntens ratings were 
based on Credentials material, Objec- 
tive test data, Autobiographical material, 
and Projective protocol material. 

(4) Intens ratings. These ratings 
were made by the Intensive Interviewers 
immediately after the Intensive inter- 
view. Thus, they were based on the data 
described under PreIntens ratings above 
plus the information obtained during 
the Initial interview. 

e. Ratings decided upon at the Pre- 
liminary Pooling Conference which oc- 
curred after the Intensive Interview. The 
team of three staff members (the two 
interviewers and the Projective or Test 
Integrator) assigned to the student partic- 
ipated in this conference. АП the data 
referred to above were available for this 
conference as were the notes taken by 
any of the three staff members prior to 
the conference. None of the ratings made 
previously on the student either by the 
three staff members or by any other staff 
members were available for the confer- 
ence. 

f. Ratings based on the Situation tests. 
There were of two types: The three sets 
of individual Contaminated Situation 
ratings made by the student's own staff 
team, and three sets made by another 
staff team who knew nothing about the 
student and thus were forced to base 
their ratings solely upon impressions 
gained from the Situation tests. In addi- 
tion, this latter staff team held a Situa- 
tion Pooling Conference at which they 
arrived at a set of Uncontaminated 
Pooled Situation ratings. 
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g. The Final Individual ratings. On 
the last morning of assessment, each 
member of the student's staff team re- 
viewed all material available on the stu- 
dent (including anything significant he 
might have gleaned during the social 
gathering following the Situation tests), 
and made a set of Final Individual rat- 
ings. 

h. Ratings by the students. On the 
last morning of assessment each student 
rated himself and his three teammates. 

i. The FinP ratings. During the last 
afternoon of assessment the student's 
staff team met for a Final Pooling Con- 
ference at which was made available not 
only all psychological material gathered 
during assessment, but all ratings that 
had been made on or by the student. The 


FinP ratings arrived at during this coi 


ference thus represent the combined 
judgment of three clinicians who had in- 
tensively studied the subject for seven 
days and who had available a wealth 
of psychological data. 


H. ASSESSMENT DATA USED IN THIS 
INVESTIGATION 

The PreInit, Init, PreIntens and 
Intens ratings described in the preceding 
section comprise the ratings whose validi- 
ties are estimated in this investigation. 
These ratings were validated against the 
FinP ratings. The cstimated reliabilities 
of the FinP ratings are presented in 
Table 1 (in Section IL, Results). Un- 
fortunately, the assessment design did 
not allow for any estimate of the relia- 
bilities of the predictor ratings. 

* The other three sets of ratings made by the 
Intensive Interviewers before the interview (see 
Section G,2,d,3 above) are being analyzed else- 
where. Their analyses are not included here 
because these ratings did not seem especially 
relevant to the main purposes of this study—the 
investigation of the validity of ratings based on 


interviews and increase in validity which can be 
attributed to the interviews. 
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The present(paper is concerned only 
with the validity of ratings on Scales B 
and C. Scale А was not rated by either 
of the interviewers prior to the interview, 
and to present the validities of the in- 
terview ratings for Scale A would add 
nothing to the conclusions drawn from 
the investigation. 


I. THE CRITERION MEASURES 


Since criterion measures are of para- 
mount importance in any validity inves- 
tigation (and this is especially true in the 
area of personality measurement where 
good criteria are rare indeed), the FinP 
ratings should be examined in detail to 
determine whether they are acceptable 
criteria of the personality variables rated. 

The estimated coefficients of reliability 
of the FinP ratings (Column 5 of Table 
1) on the Scale В variables ranged from 
-73 to .go (median .87); those on Scale C 
from .88 to .93 (median .90). These re- 
liabilities, while not as high as is desir- 
able for individual evaluations, are satis- 
factory for group measurements. Cer- 
tainly, they are of such magnitude as to 
permit evaluation of predictive devices; 
i.e., failure of any predictor to correlate 
highly with the Finp ratings can only 


be interpreted as lack of v 


alidity of the 
predictor. 


In addition, as noted above, the FinP 
Taüngs represent the combined and 
pooled judgments of three staff members, 
selected on the basis of professional com- 


petence in the field of clinic 


al holog 
de al psychology 


psychiatry, who had intensively 
studied each subject for 


reek ki 

week, making use of a 

psychologic 
Consider 


a period of one 
wide variety of 
al techniques and materials. 
ing these facts, there would 
seem to Бе little doubt that the FinP 


ratings of the personality variables mak- 


ing up Scale В are about as valid cri- 
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terion measures of these jariables as are 
attainable from trained clincians with 
present techniques. 

The Scale C variables are a different 
matter. Scale C variables are not person- 
ality traits but rather predictions regard- 
ing future performance (five years later) 
in various aspects of the job of Clinical 
Psychology; Variable C42 was an ap: 
praisal of the overall suitability of the 
student for the ficld of clinical psychol- 
ogy. It is not believed, therefore, that 
Scale С FinP ratings are acceptable as 
criterion measures, суеп though they 
may represent “better guesses” than rat 
ings made earlier in the assessment pro- 
gram, For this reason comparisons be- 
tween ratings on Scale C variables based 
on the interview situations and FinP 
ratings will be considered merely as 
agreements between two sets of ratings. 

One factor which must be considered, 
if the FinP ratings are to be used as cri- 
terion measures, is the factor of con- 
tamination of the FinP ratings. Of the 
three persons comprising the staff team 
who made the FinP ratings, two were the 
interviewers whose ratings are here being 
investigated. If the role of either In- 
tensive Interviewer or Initial Interviewer 
was regarded consistently by the other 
two team members as being а “better 
role for judging personality characteris- 
tics, then that interviewer would have a 
dominance in the pooling conference 
Which would tend to influence the other 
The effect of this 


two team members. ] 
nsistently bring 


influence would be to co! : 
the FinP ratings closer to the ratings 
made by the interviewer than would be 
Warranted by the actual merit of those 
ratings. Dominance by individuals apart 
from their roles may be disregarded since 
all interviewers acted in the dual capac: 
ity of Intensive and Initial Interviewers 
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on the same staff team. Staff members 
were rotated from week to week so that 
for each student class the staff teams were 
made up of different combinations of 
staf members; this also tended to reduce 
the effect of dominance by any particular 
individuals. 

Detailed evidence on the effect of such 
role the estimates of 
validity of the interviewer's ratings will 
be presented elsewhere (18). The evi- 
dence, while not conclusive, does indicate 
that role dominance was present; that 
the role of Intensive Interviewer was be- 
lieved by the other staff members to be 
especially advantageous for judging per- 
sonality variables; but that the effect of 
such dominance on the FinP ratings was 
slight. The possibility that Intensive In- 
térviewer validity coefficients are spuri- 
ous will be considered when the results 
of this investigation are discussed in 
Section ПІ. 


dominance on 


J. METHOD or DETERMINING VALIDITY OF 
RATINGS BASED ON INTERVIEWS 


To determine the validity of ratings 
on the various traits made by the inter- 
viewers either before or after the inter- 
view, the rating given each subject by the 
interviewer on a particular variable was 
plotted against the criterion rating given 
that subject on the same variable. Pear- 
son product-moment correlation coeffici- 
ents were then computed from the scatter 
plots, using the conventional raw-score 
formula. The various sets of correlation 
coefficients obtained are given in Table 
1 appearing in the following chapter. 

То determine the contribution of the 
interview itself to the validity of the rat- 
ings made after the interview—that is, 
the contribution of the interview apart 
from the material available to the inter- 
viewer—correlation coefficients. between 
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for Scale В and .28 for Sgale C. Appar- 
ently differences in reliability of the FinP 
ratings are only slightly related to differ- 
ences in correlations between Init rat- 
ings and FinP ratings. 

In order to test the significances of 
differences between the Jnit-FinP r's for 
all 20 variables, the y's were first con- 
verted into Fisher's z-functions by use of 
а table given by Lindquist (12, p. 212), 
and the significance of the difference be- 
tween each pair of z's was estimated. АП 
possible combinations of the go variables 
taken two at a time amount to 189 pairs 
of differences. Thirty-eight of these dif- 
ferences are significant (P = .o5 or less). 
By chance one would expect to find only 
five differences out of each hundred. (or 
nine out of the 189) reaching the .05 level 
of significance. The conclusion seems 
justified that differences do exist in the 
validities with which the different vari- 
ables were rated by the Init interviewer.? 

9. The contribution of the interview 
to the validity of ratings made after the 
Initial interview. Comparison of the Pre- 
Init-FinP correlations (based on ratings 
before the interview) with the Init-FinP 
correlations. (based on ratings after the 
interview) provides an opportunity to 
estimate the incremental contribution 
of the Initial interview to the validity of 
the post-interview ratings. If the PreInit- 
FinP т (Column 1 of Table 1) and the 
Init-FinP r's (Column 2 of Table 1) are 
squared, the differences between these 


3 This conclusion is rendered even more m. 
able by the fact that some шо [ooa 
expected to be present between ере € um 
the rs on which the 25 are base: k. EL 
computed on the same sample of үс репе 
are not independent), but the amot mo e 
correlation is unknown and thus’ was an ae 
estimating the standard errors of us i s: п < 
The effect of ignoring such corre ae E 
overestimate the standard errors of me os 
ences and thus to underestimate the significan: 


of the differences. 


725 provide estimates of the amount of 
FinP rating variance accounted for by 
the nit ratings which can be attributed 
to the Initial interview. However, the 
FinP ratings differed in reliability and it 
would seem that these "gain in variance 
accounted for" figures might be more 
representative of the contribution of the 
interview were they corrected in some 
manner for FinP reliability differences. 
Therefore, an Init Net Gain figure was 
computed by subtracting the PreInit- 
ГЕР v? to obtain a “possible gain in vari- 
ance accounted for" figure by which the 
"gain in variance accounted for" figure 
was divided. It is these relative Init Net 
Gains which are listed in Column 8 of 
Table 1.* 

For illustrative purposes, an example 
oPthe computation of the Init Net Gain 
figure for Variable B23 follows: From 
Table 1, it may be seen that for Vari- 
able Beg, the PreInit-FinP + is .og and 
the Znit-FinP r is .38. The difference be- 
tween the squares of these 7’s is .1440, 
which can be interpreted as the amount 
of the variance of the FinP ratings on 
B23 accounted for by the Init ratings 
which can be attributed to the Initial 
interview. The reliability of the FinP 
ratings on B23 is .9o. Subtracting from 

* Partial correlation coefficients could have 
been used to determine the net contribution of 
the interview to the validity of the post-inter- 
view ratings, but when both pre-interview and 
post-interview validity coefficients are large, a 
greater difference between the first order 7’s is 
necessary to obtain the partial r which would 
be obtained by a smaller difference between 
first order r's when their magnitude is smaller. 
Partial correlation coefficients thus would cause 
an underestimation of the relative contribution 
of the interview when the pre-interview ratings 
correlated higher with the criterion ratings. The 
net gains are not influenced by the magnitudes 
of the r's, and in addition, by taking account of 
the reliability of the FinP ratings, do not penal- 
ize the interviewer's ratings on those variables 


where, because of lower FinP reliabilities, the 
maximum Zntens-FinP correlation possible is 


actually less. 
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this the square of the PreInit-FinP + 
(.0004), the “possible gain in variance ac- 
counted for" figure .8996 is obtained. Di- 
viding .1440 by .8996 results in the Init 
Net Gain figure .16, which is given in 
Column 8 of Table 1. 

Examination of Column 8 indicates 
that there are appreciable differences in 
the Init Net Gains between the different 
variables, but interpretation of these dif- 
ferences is difficult. The two variables 
with highest net gains are B25 and B26, 
which were among the five variables 
(18) on which the Final Individual rat- 
ings of the Initial Interviewer had the 
highest (of the three team members) cor- 
relations with the FinP ratings, so role 
dominance may be a contributing factor. 
A common core underlying the variables 
(B24, B28, Beg, C33, and C37) whith 
gained relatively least from the Initial 
interview becomes apparent when it is 
considered that these variables all have 
fairly high loadings on the same factor:— 
one which was tentatively defined as so- 
cial intelligence, or the ability to effec- 
tively use intelligence in interpersonal 
relations. Apparently the credentials 
folder furnished some clues to general 
intelligence (college transcripts, work 
history, etc.) which allowed the raters to 
rate these variables with some success 
before the interview. The interviewer 
was able to obtain little information dur- 
ing the Initial interview which added to 
the estimate of intelligence already made, 
and little other information which was 
of help in estimating how effectively the 
subjects were able to use their intelli- 
gence in social situations, 


° This factor was obtained 
factor analysis of the interc 
FinP ratings of Scales В ап. 
the Research Project on the 
Psychologists. 


in an unpublished 
correlations between 
d C carried out b 
Selection of Clinical 


' B. THE [NTENSIVE INTERVIEW 

The mean ratings assigned by the In- 
tensive Interviewers after the interview 
ranged from 3.89 to 5.54 (median mean 
rating 4.65). The standard deviations of 
the Intens ratings varied from 1.14 to 
1.63 (median standard deviation 1.38). 

1. Validity of ratings made before the 
Intensive interview. The correlations be- 
tween the PreIntens ratings (the ratings, 
made by the Intensive Interviewers just 
before the interview, based on Cre- 
dentials, Objective test data, Autobio- 
graphical material, and Projective pro- 
tocols), and the FinP ratings appear in 
Column g of Table 1. With one exception 
(Variable B25) the correlations are sig- 
nificant at or beyond the .o1 level of sig- 
nificance. As was found true for the Pre- 
Init ratings, the variables most validly 
rated by the Intensive Interviewer before 
the interview are B31, C32, C36, and C39 
—all variables which are concerned with 
intellectual performance. 

2. Validity of ratings made after the 
Intensive interview. In Column 4 of 
Table 1 are listed the validities of rat- 
ings made by the Intensive Interviewer 
after the Intensive interview, In Column 
7 appear the validities which would have 
resulted had the FinP ratings been per- 
fectly reliable, 

АП of the Intens-FinP correlations at 
tain significance beyond the .o1 level. 

It should be noted that correcting the 
Intens-FinP т for unreliability of the 
FinP ratings raised the median r from 
-61 to .67 for Scale B, and from .68 to -7} 
for Scale C, Even had the FinP ratings 
been perfectly reliable the Intens ratings 
would have done only a slightly better 
job of predicting the FinP ratings. The 
Tank difference correlation coefficient 
rho, computed between the rank orders 


= 


^ 
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of the uncorrected r's ang the ?'s cor- 
rected for attenuation is equal to .97, in- 
dicating that the unreliability of the 
FinP ratings had little effect on the rank 
order of the validities with which differ- 
ent variables were rated after the Inten- 
sive interview. However, differences in 
the reliability with which the variables 
were rated by the staff team during the 
Final Pooling Conference may be related 
to differences in the validity with which 
the variables were rated by the Inten- 
sive Interviewer. The rho’s between the 
rank orders of the FinP reliability coeffi- 
cients and the rank orders of the Intens- 
FinP correlations are equal to .69 for 
Scale В and .19 for Scale C, suggesting 
that for Scale B, trait differences in In- 
tensive Interview validities would be 
smaller were the FinP ratings of equal 
reliability. 

The Intens-FinP rs were converted 
into z's and tested for significance of dif- 
ferences between them in the manner de- 
scribed for the Init-FinP rs. Thirty-eight 
pairs of differences of the total of 189 
pairs were found to be significant at the 
.o5 level, indicating that differences do 
exist in the correlations between the 
Intens and FinP ratings. 

3. The contribution of the interview 
to the validity of ratings made after the 
Intensive interview. The relative Net 
Gain figures for the Intensive interview 
(obtained by the method described for 
the Initial interview—Section П, A, 3 
above) are shown in Column 9 of Table 
1. The variables (B25, B30, C35, C38, 
C4o, and C42) to which the Intensive 
interview makes the greatest relative con- 
tribution seem to be those concerned 
with social relations with others in the 
professional work situation. The vari- 
ables to which the Intensive interview 


contributes very little (B23, B27, Bag, 
C32, and C36) appear to be variables con- 
cerned with effective intelligence or in- 
lellectual efficiency. B23 (Social Adjust- 
ment) does not fall into this class, but 
this variable is relatively broad in defini- 
tion and has many aspects which render 
it poorly ratable on the basis of a face- 
to-face conversation. Also, even rela- 
tively poorly adjusted individuals might 
be able to put up a facade for the inter- 
view period. 

Role dominance may be a contribut- 
ing factor to the apparent contribution 
of the Intensive interview. That is, those 
variables which show relatively high Net 
Gains may do so, not because of any 
contribution of the interview itself, but 
because the other two team members, be- 
lieving the Intensive Interviewer to be 
the best informed on these variables, de- 
[erred to his judgments when determin- 
ing the FinP ratings. In fact, five of the 
six variables with the highest relative 
Intens Net Gains were variables for 
which the Final Individual ratings (just 
before the Final Pooling Conference) 
made by the Intensive Interviewer cor- 
related higher with the FinP ratings than 
did the Final Individual ratings made 
by either of the other two staff members 
(18). Of the five variables with the lowest 
relative Intens Net Gains only two were 
variables for which the Final Individual 
ratings of the Intensive Interviewer had 
higher r’s with the FinP ratings. 


C. COMPARISON OF THE INTENSIVE AND 
INITIAL INTERVIEWS 


1. Agreement between ratings made 
after the interviews. In Column 10 of 
Table 1 are listed the correlations be- 
tween the Init ratings and the Intens rat- 
ings. Thirteen of the 20 7's were signifi- 
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cant at the .o1 level. Two facts seem of 
especial interest here. First, the median r 
for Scale B is .19, while that for Scale C is 
.31. Apparently, the two interviewers 
agreed better with each other on ratings 
of performance five years in the future 
than they did on ratings of present per- 
sonality traits. Second, for no variable 
did the ratings made after the interviews 
correlate with each other to a greater 
extent than either set of ratings cor- 
related with the FinP ratings. In addi- 
tion, it may be noted that the variables 
(B31, C32, and C36) on which the ratings 
made after the interviews showed the 
greatest agreement are among those 
which were most validly rated before the 
interview by both the Initial and the In- 
tensive Interviewers and which gained 
least in terms of relative net gains frém 
either of the interviews. That is, those 
variables to the rating of which the inter- 
views contributed relatively little are the 
very ones on which there is most agree- 
ment after the interviews. 

2. Comparison of the Interviewers’ rat- 
ings with respect to correlation with the 
FinP ratings. In Columns 1 through 4 
of Table 4 appear four sets of correla- 
tions between Interviewers’ ratings and 
FinP ratings: 

The PreInit-FinP 7's (Column 1) are 
the correlations between the FinP ratings 
and ratings (based on Credentials ma- 
terial only) made by the Initial Inter- 
viewer before the interview, 

The Init-FinP т (Column 2) are cor- 
relations between the FinP ratings and 
ratings by the Initial Interviewer 
the one-hour Initial interview. 

The PreIntens-FinP yg ( 
are correlations between the 
and ratings by the Intensive 
just prior to the interview. 
these ratings, the Inte 


er after 


Column 3) 
FinP ratings 
Interviewer 
ew. In making 
nsive Interviewer 


had availablepCredentials material, Ob- 
jective test data, Autobiographical ma- 
terial, and Projective protocols. 

The Intens-FinP үз (Column 4) are 
correlations between the FinP ratings 
and ratings by the Intensive Interviewer 
after a two-hour Intensive interview. 

These four sets of correlations permit 
of six comparisons, each of which yields 
some interesting information. 

a. The comparative validities of pre- 
interview ratings based on different 
amounts of written matcrial. When the 
PreInit-FinP correlations are compared 
with the PreIntens-FinP, it is evident 
that the latter are appreciably higher for 
all variables. When it is considered that 
the PreInit ratings were based on Cre- 
dentials material while the PreIntens rat- 
ings were based on Credentials material 
plus considerable other psychological 
data, it seems indicated that the validity 
of personality-trait ratings will vary with 
the amount of psychological data made 
available to the raters, 

‚_ b. The comparative validities of rat- 
Ings made after the Initial and Intensive 
interview situations. The Intens-FinP 
correlations were consistently higher 
than the Init-FinP correlations. Whether 
the relatively greater validity of ratings 
after the Intensive interview situation 
can be attributed to the greater amount 
iid psychological data available to the In: 
tensive Interviewer, or to the greater 
length and depth of the Intensive inter 


view, ог to both factors, is a matter 0 
speculation, Th 


€ evidence presented be 
low ( 


2W (Section II, C, 3) when the inte 
views are compared with respect to rela- 
tive net gains suggests that most, if not 
all, of the apparent superiority of the 
Intensive interview situation is a result 
of the greater amount of written dat? 
available before the interview. 
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с. The comparative vélidities of rat- 
ings made before and after the inter- 
views. Four comparisons may be made 
here. Two of these, the comparison of the 
validities of ratings before and after the 
Initial interview and before and after 
the Intensive interview have been made 
indirectly in the sections (II, A, 3, and 
II, B, 3, above) wherein the material con- 
cerning the met gains in validity at- 
tributable to the interview was pre- 
sented, so it need be only stated here 
that for all variables, ratings made after 
each type of interview correlated higher 
with the FinP ratings than did ratings 
made before that interview. The third 
comparison, that between PreInit-FinP 
correlations and Jntens-FinP correlations, 
merits little consideration since it merely 
substantiates the rather obvious hypoth- 
esis that ratings based on a thorough 
interview plus a comprehensive mass of 
written psychological data are more valid 
than ratings based on only a little written 
data without benefit of an interview. 

The last comparison, that between 
Init-FinP correlations and PreIntens 
FinP, seems of such importance that it is 
worthy of consideration separately. 

d. Comparison of ratings made after 
the Initial interview with ratings made 
before the Intensive interview. Propo- 
nents of the appraisal interview frequent- 
ly state that the value of interviews lies in 
their flexibility. The argument goes that 
in the interview situation it is possible 
to follow up leads; to confirm or reject 
hypotheses regarding the personality 
dynamics of the interviewee; and thus to 
form much more accurate judgments 
than are possible on the basis of test data 
alone. Test scores, on the other hand, 
are rigid and inflexible. Even projective 
devices offer only hints to possible dy- 
namics. Thus, test material may have 


some value in assessing the more super- 
ficial surface traits, but only by talking 
to the subject, by asking him questions, 
and by listening to his answers, may the 
underlying, more fundamental, source 
traits be accurately appraised. 

Comparison of the Init-FinP correla- 
tions (Column 2, of Table 1) with the 
PreIntens-FinP correlations provides at 
least a partial test of this belief. The 
Init ratings were based on a one-hour 
interview (plus the background material 
available in the Credentials file). The 
PreIntens ratings were based entirely on 
written data (with no opportunity to 
talk to, or even see, the subject). The 
Scale B traits are source traits and hence 
supposedly more validly rated on the 
basis of an interview. 

“This belief is not upheld by the present 
study. The median Znit-FinP correlation 
for Scale B traits is .42; the median 
PreIntens-FinP correlation is .49. The 
Init ratings are more valid for two of 
the Scale B traits: the PreIntens ratings 
are more valid for seven. 

Approximately the same relationship 
existed for the Scale C variables. The 
median Init-FinP r is .46; the median 
PreIntens-FinP т is .57. The PreIntens | 
ratings are more accurate for nine of 
the eleven variables. 

Thus, ratings based on only test and 
biographical data are more valid than 
ratings based on an interview plus the 
information contained in a credentials 
folder. 

3. Comparison of the interviews with 
respect to relative net gains. When the 
relative net gains in correlations between 
ratings which can be attributed to the 
interviews (Columns 8 and 9 of Table 1) 
are compared, the Intensive Interviewer 
scems to gain slightly more from the 
interview than does the Initial Inter- 
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viewer, although tie difference is neither 
statistically nor practically significant. 
The median met gain is .24 for the In- 
tensive Interviewer and .16 for the Ini- 
tial Interviewer. The fact that the Inten- 


sive Intervief*er appears to have been 
considered by the other two staff mem- 
bers as being in a somewhat better posi- 
tion to rate many of the variables renders 
cven this small difference questionable. 


(9 


Hl. DISCUSSION, CONCLUSIONS, AND SUGGESTIONS 
FOR FURTHER RESEARCH 


HE PRESENT investigation. indicates 
TS. interviews apparently do con- 
tribute to the validity with which person- 
ality variables are rated (i.e., the assess- 
ment raters typically made more valid 
ratings on the basis of an interview plus 
psychological data than they did on the 
basis of psychological data alone). That 
contribution is slight, however, and may 
be even less than has been demonstrated 
here. It should be remembered that two 
of the three criterion team members 
were the interviewers whose ratings 
were studied. Were the criterion meas- 
ures truly independent—had none of the 
criterion team members been inter- 
viewers—it does not seem likely that the 
validity of the interviewers’ ratings 
would have been as high as was found in 
this study. On the other hand, it is pos- 
sible that had the interviews occurred 
first—with psychological data presented 
to the interviewer after he had already 
rated on the basis of the interview—their 
validity coefficients might have been 
higher, with the psychological test data 
yielding only a slight additional validity 
to the interview ratings already made. 
‘That is, the psychological data when pre- 
sented first as in this investigation may 
have given the interviewers a set so that 
they were unable to change their ratings 
after the interview. It scems unlikely 
that any such persistence of initial im- 
pressions actually did decrease the effec- 
tive validity of the ratings after the in- 
Lerviews for these reasons: (1) The Ini- 
tial Interviewers, who had a minimum of 
information (only the cre- 
dentials files) available before the inter- 
view, did not gain more from the inter- 
View than did the Intensive Interviewers 


psychological 


who had much psychological data avail- 
able before the interview and thus had 
opportunity to form stronger initial im- 
pressions. (э) Ratings based on unstruc- 
tured appraisal interviews, unless used 
in conjunction with some sort of objec- 
tive psychological data, have been shown 
to be unreliable, and hence invalid. Addi- 
tional research is necessary for a definite 
answer to this question. It is unfortunate 
that the assessment program could not 
have been designed to answer this and 
the similar questions arising when the 
validity of any of the many assessment 
techniques is investigated, but, of course, 
allspossible permutations of the various 
techniques would require several hun- 
dred summers of assessments, Since but 
one assessment was possible it was actu- 
ally designed on the basis of increasing 
cost of technique. Thus, credentials cost- 
ing only postage were made available to 
the staff first, objective tests second, the 
interviews (fairly expensive since they 
required both students and interviewers 
to be present) later, and last the situation 
tests requiring groups of students and 
groups of staff members, 

When the two types of interview situa- 
tions are compared with respect to valid- 
ity of ratings made by the interviewers 
after the interviews, it is apparent that 
the Intensive Interviewer was typically 
able to make ratings which were more 
valid than were ratings made by the Ini- 
tial Interviewer. When the two types of 
interview situations are compared with 
respect to relative net gains this relation- 
ship still exists, but the difference be- 
comes statistically insignificant. In addi- 
tion, when it is considered that the In- 
tensive Interviewer may have been con- 
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sidered by the other two team members 
to be the person in the best position to 
rate, a large portion of this difference in 
validity would seem to disappear. 

There appears, then, to be little differ- 
ence in the true relative contributions of 
the two types of interview situations. The 
Intensive interview, which was two hours 
in length and in which an attempt was 
made to probe for underlying personality 
dynamics, contributed little, if any, more 
than did the Initial interview, which 
lasted only one hour and which was os- 
tensibly devoted to the gathering of 
rather superficial information. 

Since differences in the interviews 
themselves cannot account for the differ- 
ences in validity of ratings after the two 
interview situations, these differences 
would seem to be functions of the differ- 
ences in the amount of psychological 
test data available to the interviewers for 
study before the interview. This is 
further evidenced by the fact that the 
PreIntens validities were higher than the 
PreInit validities. 

ОЁ equal significance is the fact that 
ratings based on the one-hour interview 
plus credentials (the Init ratings) cor- 
related less well with the criterion ratings 
than did the PreIntens ratings—ratings 
which were based on a varied and com- 
prehensive set of psychological data but 
without any interview. Apparently, 
skilled clinicians can use fairly complete 
psychological test data in the rating of 
personality traits slightly more effectively 
then they can use an interview, even 
when the interview is aided by historical 
wii in the form of credentials, Un- 
mad: dde wie ars t a 

oh x a y. The median va. 
lidity of the Inil ratings for all twent 
variables is .45 and the median validity 
of the PreIntens ratings is .49. Such cor- 


relation coeffitients account for less than 

25% of the variance in the criterion rat- 
ings, and indicate the predictor ratings to 
be only about 25% more valid than 
chance ratings—this despite the fact “that 
the criterion ratings are based on judg- 
ments of the same persons who inter- 
preted the psychological data and con- 
ducted the interviews. If the Initial inter- 
view situation is in fact (as suggested in 
Section I, E) comparable to the typical 
personnel selection or college admission 
interview, it would appear these inter- 
views have little actual value, 

Combination of comprehensive psy- 
chological data and a two-hour probing 
interview results in Intensive interview 
ratings whose median correlation with 
the criterion ratings is about .63, account- 
ing for about 4097, of the criterion vari- 
ance, This is a sizeable increase over the 

20% accounted for by ratings based on 
psychological data alone, but still leaves 
the greater percentage of the criterion 
rating variance unaccounted for—a ре!“ 
centage which would probably be still 
greater were there not a personnel over- 
lap between the persons making the pre 
dictor and the criterion ratings. 

, Validity coefficients of .65 are con 
sidered fairly satisfactory for group tests 
used for selection purposes, but for indi- 
vidual personality assessment somewhat 
higher validities are needed. Especially 
is this true in the areas of clinical PSY’ 
chology and psychiatry where decisions 
asta айкы, to clients and pa 
deon the de arae базе a 
psychological iren ^ esses сс mp 

ata than were available 1” 


the present instance, 
A. CONCLUSIONS 


" The conclusions to be drawn from this 
vestigation have been implicit through 


w” "ed 


ia 
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out this and the preceding chapter. Ex- 
plicitly stated they are: 

1. The contribution of the unstruc- 
tured appraisal interview to the validity 
of personality-trait ratings based on an 
interview plus psychological data is 
slight. 

2. Longer interviews with the objec- 
tive of uncovering personality dynamics 
contribute little, if any, more to the 
validity of personality-trait ratings than 
do shorter interviews whose main objec- 
tive is the eliciting of information. 

3. The more comprehensive the psy- 
chological data available, the more valid 
will be personality-trait ratings based on 
that data. 

4. Personality-trait ratings based on 
fairly comprehensive psychological test 
data are more valid than are ratings 
based on interviews with only a little 
psychological data in the form of creden- 
tials material available. 

5. At the present time, even skilled 
clinicians, having available for study and 
integration a wide variety of objective 
test data, projective protocols, creden- 
tial material, and autobiographical data, 
plus data accruing from a probing face- 
to-face interview, do not appear able to 
make personality assessments, in the form 
of ratings of personality traits, which 
have sufficient validity to be as useful in 
individual case work as is desirable. 


B. SUGGESTIONS FOR FURTHER RESEARCH 


- This investigation, although it repre- 
sents a definite contribution to the study 


of the validity of personality-trait ratings 
based on interviews, in that it provides 
an estimate of the increment in validity 
which may be attributed to the inter- 
view itself, has left at least three impor- 
tant questions unanswered: 

1. The previously mentioned problem 
of whether the presentation of psycholog- 
ical data before the interview formed a 
“set” in the interviewer which prevented 
his post-interview ratings being changed 
sufficiently to reflect the true validity of 
the interview. 

2. The interview as a selection instru- 
ment could not be evaluated here since 
the Final Pooled ratings of Scale C vari- 
ables did not appear to be acceptable as 
selection criteria. A similar study is neces- 
sary to determine the incremental valid- 
ity of interview ratings of job-success 
variables when those ratings are evalu- 
ated against acceptable criteria of job- 
success. 

3. Although this investigation was 
able in some measure to estimate the in- 
cremental validity of the interview when 
used to assess personality variables, it is 
probable that such incremental validity 
has been overestimated inasmuch as the 
criterion measures themselves were deter- 
mined by the persons who originally 
made the interview ratings. It would 
seem desirable, then, that another study 
be carried out so that the incremental 
validity of the interview in the assessment 
of personality-traits might be evaluated 
against truly independent criteria. 


APPENDIX 
RATING SCALE DEFINITIONS 


ScALE B (Variables Nos. 23-31) 

Note: For ratings on this scale, 1 — left side or low, 8 — right side or high. 

Since many of the following attributes (223-30) are broad factors, it is unlikely that any person 
will fit all the phrases grouped together at one pole of a given variable. Note also that for some 
items neither extreme necessarily represents a desirable attribute. 

23. Social Adjustment: How well docs he adjust to varied interpersonal situations? (Includes sexual 
adjustment only as it affects social adjustment in general.) š 


Acts without consideration for feelings of others; 
often rejected by others, often appears aloof, 
hostile, or irritable. 


Actively considers feelings of others; readily 
gains acceptance in interpersonal relationships: 
maintains a friendly and likcable manner. 


24. Appropriateness of Emotional Expression: How appropriate are his emotional responses to the 


situation? 


Fails to adapt his emotional responses to the 
needs of the situation; shows disorganized or 
overly constricted emotional responses. 


Shows emotional responses of a quality and 

intensity befitting the situation; rcacts spontane- 

ously but appropriately; shows well-integratcd 

and flexible patterns of emotional behavior. 
25. Characteristic Intensity of Inner Emotional Tension: How i is his i ional life 
š P : inten iona 

as inferred from all available clues? seris ИБН 


Inner emotional life characterized by a minimám H 
ya as s y ional drives r 1 
of persistent internal tensions, e$ Strongly repressed emotional drives result 
ing in inner turmoil; great inner conflict. and 
Strong pent-up emotions, 


26. Sexual Adjustment: To what degree do his sexual needs and activities affect his overall adjust- 


ment? 


His sexual fieeds and activities Seriously inter- His Sexual nceds and activities c 

8 ; ” Ж ha 

А i 1 1 d viti lefinitely enl 
hi erall adjustment, his overall a ljustment 


27. Motivation for Professional Status: How strong is his driy 


fessional career? 
28. Motivation for Scientific Understandin 
i g of People: How stri се و‎ vin 
facts, theories, and skills ; aan w strong arc his drives toward acquiring 
Insight into Others: Sor cain Тү a understanding of individual human ‘beings? 
jos 1 ` oes ci i = iya- 
tions of others? 5 he have into the attitudes, emotions, and motiva 


€ for the status-rewards of a pro 


Interprets behavior at its face value: j in 
to ws but gross T m ea du m Sood awareness of underlying dynamics of 
i ч = , behavior; i: карам ез ЕА 
not develop any integrated 1 + 15 Sensitive to s ances of behav 
behavior ei of Bs on а of ae responses; is able rae ы un- 
N i Д erstanding of the A 
go. Insight into Himself: How much insight does he h 8 of the behavior of people. 
own attitudes, emotions and motivations? i 
31. Quality of Intellectual Accom 


the underlying dynamics of his 
plishmenis: W! ; 
output? hat is the characteristic quality 


of his intellectual 
Intellectual work is characteristicall 
8 у of low Cha e 
GAE racteristic; 
ын ‘gh quality, 
SCALE C—CnrrrRIO; 
а N SKILLS (Vari 
Note: For ratings on this scale, y — low,8 — high (Variables Nos. 32-42) 
Ratings on No. 32 refer to рег in ° 
M. uci i» Ls р formance in Braduate school: rati t's 
l years hence (i.e., after опе year of experience TAUnBS on Nos. 33-42 refer to studen 
What will be his level of competence or skill in u НЫ pan AS БАШ; 
8. е Performance (during next three or [e oe oF 
Effectively master course 5): How wi i : 
psychology. die and еннен; Successtully complete course Ned на; inical 
‘ology, Я elated fields; satisfy language теш S in general psychology, clinica 
examinations. © “86 requirements for the doctorate: pass gencral 


ally produces intellectual work of 
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33- Clinical Diagnosis: How Pell will he: 

Recognize dynamics underlying particular responses in both objective and projective tests, observe 
significant interrelationships among responses, relate findings to case history and other clinical data. 

Flicit from the patient information required for mental status examinations and case histories; 
ascertain and evaluate attitudes and incidents of psychological significance in the patient. Synthesize 
clinical findings to arrive at an integrated picture of personality development, structure, and func- 
tion. E 

34 Individual Psychotherapy: How effectively will he: Conduct various types of individual psycho- 
therapy. 

35. Group Psychotherapy: How effectively will he: Conduct various types of group psychotherapy. 

36. Research: How well will he: 

Recognize and define important research problems in clinical psychology; critically evaluate and 
apply the research findings of others; think with originality and scientific rigor; employ appropriate 
experimental design and statistical methods; grasp practical implications of findings; present results 
and conclusions in clear, comprehensive, and well-organized form. 

37. Administration: How well will he: 

Plan and develop psychological programs; make proper administrative decisions; delegate responsi- 
bility appropriately; elicit cooperation from subordinates and superiors; maintain high morale among 
his staff; carry out or direct an appropriate public relations program. 

38. Supervising Clinical Psychologists: How well will he: 

Carry out the professional supervision of subordinates assigned to him for duty and on-the-job 
instruction; assign their duties; evaluate their performance; instruct them in clinical techniques; 
perform other aspects of in-service training. 

80. Teaching Psychology (in a College or University); How well will he: 

Teach college courses in general psychology; motivate students; present concepts and procedures; 
stimulate critical thinking about and integration of c@urse materials; evaluate the products of learn- 
ing. 

40, Professional Interpersonal Relations: How well will he: 

Work cooperatively with superiors, subordinates, members of the mental team, and other pro- 
fessional personnel concerned with the patient's welfare; participate in the give-and-take of staff 
conferences; contribute to group decisions. 

41. Integrity of Personal and Professional Behavior: How well will he: 

Recognize and fulfill professional responsibilities; live up to personal commitments; show loyalty 
to professional obligations in the event of outside pressure or promise of personal gain; maintain 
discretion concerning professional matters; appropriately conform with commonly accepted standards 
of moral and social behavior; refrain from coloring facts, evasion, lying, etc. 

2. Overall Suitability for Clinical Psychology: In view of his assets and liabilities, how well will 
he be able to: 

Carry out the several duties—diagnosis, therapy, and research—specified for the position of clinical 
psychologist (P-4 and above) in the Veterans Administration. 
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